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COMPOSITIONS AND METHODS FOR THE TREATMENT 
AND DIAGNOSIS OF BREAST CANCER 



TECHNICAL FIELD 

The present invention relates generally to the detection and therapy of 
5 breast cancer. The invention is more specifically related to nucleotide sequences that 
are preferentially expressed in breast tumor tissue and to polypeptides encoded by such 
nucleotide sequences. The nucleotide sequences and polypeptides may be used in 
vaccmes and phamiaceulical compositions for the prevention and treatment of breast 
cancer. The polypeptides may also he used for the production of compounds, such as 
10 antibodies, useful for diagnosing and monitoring the progression of breast cancer in a 
patient. 



BACKGROUND OF THE INVENTION 

Breast cancer is a significant health problem for women in the United 
States and throughout the world. Although advances have been made in detection and 
15 treatment of the disease, breast cancer remains the second leading cause of cancer- 
related deaths in women, affecting more than 180,000 women in the United States each 
year. For women in North America, the life-time odds of getting breast cancer are now 
one in eight. 

No vaccine or other universally successful method for the prevention or 
20 treatment of breast cancer is currently available. Management of the disease currently 
relies on a combination of early diagnosis (through routine breast screening procedures) 
and aggressive treatment, which may include one or more of a variety of treatments 
such as surgery, radiotherapy, chemotherapy and honnone therapy. The course of 
treatment for a particular breast cancer is often selected based on a variety of prognostic 
25 parameters, including an analysis of specific tumor markers. See, e.g., Porter-Jordan 
and Lippman, Breast 'Cancer <?:73-100 (1994). However, the use of established markers 
often leads to a result that is difficult to interpret, and the high mortality observed in 
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breast cancer palienis indicates that improvements are needed in the treatment, 
diagnosis and prevention ot the disease. 

Accordingly, there is a need in the art for improved methods for therapy 
and diagnosis of breast cancer. The present invention fulfills these needs and further 
5 provides other related advantages. 

SUMMARY OF THE INVENTION 

Briefly stated, the subject invention provides compositions and methods 
■for the diagnosis and therapy of breast cancer. In one aspect, isolated DNA molecules 
arc provided, comprising (a) a nucleotide sequence preferentially expressed in breast 

10 cancer tissue, relative to normal tissue; (b) a variant of such a sequence that contains 
one or more nucleotide substitutions, deletions, insertions and/or modifications at no 
more than 20% (preferably no more than 5"/q) of the nucleotide positions, such that the 
antigenic and/or immunogenic properties of the polypeptide encoded by the nucleotide 
sequence are retained; or (c) a nucleotide sequence encoding an epitope of a polypeptide 

15 encoded by at least one of the above sequences. In one embodiment, the isolated DNA 
molecule comprises a human endogenous retroviral sequence recited in SEQ ID NO:l. 
In other embodiments, the isolated DNA molecule comprises a nucleotide sequence 
recited in any one of SEQ ID NO: 3^26, 28-77, 142, 143, 146-152, 154-166, 168-176, 
178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 

20 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 
291-297. 

In related embodiments, the isolated DNA molecule encodes an epitope 
of a polypeptide, wherein the polypeptide is encoded by a nucleotide sequence that: (a) 
hybridizes to a sequence recited in any one of SEQ ID NO: 1, 3-26, 28-77, 142, 143, 
25 146-152, 154-166, 168-176, 178^192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 
219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 
278, 280, 281, 284, 288 and 291-297 under stringent conditions; and (b) is at least SOVc'^ 
identical to a sequence recited in any one of SEQ ID NO: 1, 3-26, 28-77, 142, 143, 146- 
152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 
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221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 
280, 281, 284, 288 and 291-297; and wherein RNA corresponding to said nucleotide 
sequence is expressed at a greater level in human breast tumor tissue than in normal 
breast tissue. 

5 In another embodiment, the present invention provides an isolated DNA 

molecule encoding an epitope of a polypeptide, the polypeptide being encoded by: (a) a 
nucleotide sequence transcribed from' the sequence of SEQ ID NO: 141; or (b) a variant 
of said nucleotide sequence that contains one or more nucleotide substitutions, 
"deletions, insertions and/or modifications at no more than 20% of the nucleotide 

10 positions, such that the antigenic and/or immunogenic properties of the polypeptide 
encoded by the nucleotide sequence are retained. Isolated DNA and RNA molecules 
comprising a nucleotide sequence complementary to a DNA molecule as described 
above are also provided. . i 

In related aspects, the present invention provides recombinant expression 

15 vectors comprising a DNA molecule as described above and host cells transformed or 
trdnsfccted with such expression vectors. 

In further aspects, polypeptides, comprising an amino acid sequence 
encoded by a DNA molecule as described above, and monoclonal antibodies that bind 
to such polypeptides are provided. 

20 . : In yet another aspect, nvethods are provided for determinmg the presence 

of breast cancer in a patient. In one embodiment, the method comprises detecting, 
within a biological sample, a polypeptide as described above. In another embodiment, 
the method comprises detecting, within a biological sample, an RNA molecule 
encoding a polypeptide as described above. In yet another embodiment, the method 

25 comprises (a) intradermally injecting a patient with a polypeptide as described above; 
and (b) detecting an immune response on the patient*s skin and therefrom detecting the 
presence of breast cancer in the patient. In further embodiments, the present invention 
provides methods for determining the presence of breast cancer in a patient as described 
above wherein the polypeptide is encoded by a nucleotide sequence selected from the 

30 group consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 
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215, 217, 220, 241, 242, 240, 24S, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 
285-2S7, 2S9, 290 and sequences that hybridize thereto under stringent conditions. 

In a related aspect, diagnostic kits useful in the determination of breast 
cancer are provided. The diagnostic kits generally comprise either one or more 
5 monoclonal antibodies as described above, or one or more monoclonal antibodies that 
bind to a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of sequences provided in SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 
199, 205, 208, 215, 217, 220, 241, 242 and 246, 248, 249, 252, 256, 267, 270, 274, 277, 
279, 282, 2S3, 285-287, 289, 290 and a detection reagent. 

10 Within a related aspect, the diagnostic kit comprises a first polymerase 

chain reaction primer and a second polymerase chain reaction primer, at least one of the 
primers being specific for an RNA molecule described herein. In one embodiment, at 
least one of the primers comprises at least about 10 contiguous nucleotides of an RNA 
molecule as descnbed above, or an RNA molecule encoding a polypeptide encoded by a 

15 nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 
145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252, 
256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 

Within another related aspect, the diagnostic kit comprises at least one 
oligonucleotide probe, the probe being specific for a DNA molecule described herein. 

20 In one embodiment, the probe comprises at least about 15 contiguous nucleotides of a 
DNA molecule as described above, or a DNA molecule selected from the group 
consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217, 220, 241, 242 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 
289 and 290. 

25 In another related aspect, the present invention provides methods for 

monitoring the progression of breast cancer in a patient. In one embodiment, the 
method comprises: (a) detecting an amount, in a biological sample, of a polypeptide as 
described above at a first point in time; (b) repeating step (a) at a subsequent point in 
time; and (c) comparing the amounts of poliTDcptide detected in steps (a) and (b), and 

30 therefrom monitoring the progression of breast cancer in the patient. In another 
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embodiment, the method comprises (a) detecting an amount, within a biological sample, 
ot an RNA molecule encoding a polypeptide as described above at a first point in time; 
(b) repeating step (a) at a subsequent point in time; and (c) companng the amounts of 
RNA molecules detected in steps (a) and (b), and therefrom monitoring the progression 
5 of breast cancer in the patient. In yet other embodiments, the present invention provides 
methods for monitoring the progression of breast cancer in a patient as described above 
wherein the polypeptide is encoded by a nucleotide sequence selected form the group 
consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
-217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285- 

10 2S7, 289, 290 and sequences that hybridize thereto under stringent conditions. 

In still other aspects, pharmaceutical compositions, which comprise a 
polypeptide as described abbve in conibination with a physiologically acceptable 
carrier, and vaccines, which' cotnprise a polypeptide as described above in combination 
with an immune response' ehha'ncer or adjuvant, are provided. In yet other aspects, the 

15 present invention provides phamlaceutical compositions and vaccines comprising a 
polypeptide encoded by b nucleotide sequence selected from the group consisting of 
SEQ ID NO: 78-86, 144, 145,153, 167, 177, 193, 199, 205, 208,215,217, 220, 241, 
242 and 246, 248, 249, 252:256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 
and sequences that hybridize thereto under stringent conditions. 

20 In related aspects, the present invention provides methods for inhibiting 

the development of breast cancer in a patient, comprising administering to a patient a 
pharmaceutical compdsition or vaccine as described above. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 

25 references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the differential display PGR products, separated by gel 
electrophoresis, obtained from cDNA prepared from normal breast tissue (lanes 1 and 2) 
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and from cDNA prepared from breast tumor tissue from the same patient (lanes 3 and 
4). The arrow indieates the band corresponding to B 1 8Agl . 

Figure 2 is a northern blot companng the level of BlSAgl mRNA in 
breast tumor tissue (lane 1 ) with the level in normal breast tissue. 
5 Figure 3 shows the level of BlSAgl mRNA in breast tumor tissue 

compared to that in various normal and non-breast tumor tissues as determined by 
RNase protection assays. 

Figure 4 is a genomic clone map showing the location of additional 
-retroviral sequences obtained from ends of Xbal restriction digests (provided in SFQ ID 
10 NO:3 - SEQ ID NO: 10) relative to BlSAgl. 

Figures 5 A and 5B show the sequencing strategy, genomic organization 
and predicted open reading frame for the retroviral element containing BlSAgl. 

Figure 6 shows the nuc<eotide sequence of the representative breast 
tumor-spec i tic cDNA BlSAgl. 
15 Figure 7 shows the nucleotide sequence of the representative breast 

tumor-specific cDNAElTAgl. 

Figure 8 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B17Ag2. 

Figure 9 shows the nucleotide sequence of the representative breast 
20 tumor- speci fic cDN A B 1 3 Ag2a. 

Figure 10 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B13Aglb. 

Figure 1 1 shows the nucleotide sequence of the representative breast 
tumor-specific cDNA B13Agla. 
25 Figure 12 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA BllAgl. 

Figure 13 shows ihe nucleotide sequence of the representative breast 
tumor- specific cDNA B3CA3c. 

Figure 14 shows the nucleotide sequence of the representative breast 
30 tumor-specific cDNA B9CG1 . 
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Figure 15 shows the nucleotide sequence of the representative breast 
tumor- specific cDNA B9CG3. 

Figure 16 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B2CA2. 
5 Figure 17 shows the nucleotide sequence of the representative breast 

tumor-specific cDNA B3CAI . 

Figure 18 shows the nucleotide sequence of the representative breast 

tumor-spccific cDNA B3CA2. 

Figure 19 shows the nucleotide sequence of the representative breast 
10 tumor-spccific cDNA B3CA3. 

Figure 20 Ishows^ the nucleotide sequence of the representative breast 
. tumor-specific cDN A B4CAh.: 

Figure 21 Ai depicts RT-PCR analysis of breast tumor genes in breast 
tumor tissues (lanes 1-S) and normal breast tissues (lanes 9-13) and FLO (lane 14). 
15 Figure 21 B depicts RT-PCR analysis of breast tumor genes in prostate 

tumors (lane 1, 2), colon tumors (lane 3), lung tumor (lane 4), normal prostate (lane 5), 
normal colon (lane 6), normal kidney (lane 7), normal liver (lane 8), normal lung (lane 
9), normal ovary (lanes 10, 18), normal pancreases (lanes 11, 12), normal skeletal 
muscle (lane 13), normal skin (lane 14), normal stomach (lane 15), normal testes (lane 
20 16), normal small intestine (lane 17), HBL-100 (lane 19), MCF-12A (lane 20), breast 
tumors (lanes 21-23), H.O (lane 24), and colon tumor (lane 25). 

DETAILED DESCRIPTION OF THE INVENTION 

As noted above, the present invention is generally directed to 
compositions and methods for the diagnosis, monitoring and therapy of breast cancer. 
25 The compositions described herein include polypeptides, nucleic acid sequences and 
antibodies. Polypeptides of the present invention generally comprise at least a portion 
of a protein that is expressed at a greater level in human breast tumor tissue than in 
normal breast tissue (/.e., the level of RNA encoding the polypeptide is at least 2-fold 
higher in tumor tissue). Such polypeptides are referred to herein as breast tumor- 
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specific polypeptides, and cDNA molecules encoding such polypeptides are referred to 
as breast tumor-specific cDNAs. Nucleic acid sequences of the subject invention 
generally comprise a DNA or RNA sequence thai encodes all or a portion of a 
polypeptide as described above, or that is complementary to such a sequence. 
5 Antibodies are generally immune system proteins, or fragments thereof, that are capable 
of binding to a portion of a polypeptide as described above. Antibodies can be 
produced by cell culture techniques, including the generation of monoclonal antibodies 
as described herein, or via transfection of antibody genes into suitable bacterial or 
-mammalian cell hosts, in order to allow for the production of recombinant antibodies. 
10 Polypeptides within the scope of this invention include, but are not 

limited to, polypeptides (and epilopcs thereof) encoded by a human endogenous 
retroviral sequence, such as the sequence designated BlSAgl (Figure 5 and SEQ ID 
NO:l). Also within the scope of the present invention arc polypeptides encoded by 
other sequences within the retroviral genome containing BlSAgl (SEQ ID NO: 141), 
15 Such sequences include, but are not limited to, the sequences recited in SEQ ID NO:3 - 
SEQ ID NO: 10. BlSAgl has homology to the gag p30 gene of the endogenous human 
retroviral element S71, as described in Werner et al.. Virology 7?*/:225-238 (1990) and 
also shows homology to about thirty other retroviral gag genes. As discussed in more 
detail below% the present invention also includes a number of additional breast tumor- 
20 specific polypeptides, such as those encoded by the nucleotide sequences recited m 
SEQ ID NO: 11-26, 28-77, 142, 143, 146-152, 154-166, 16S^176, 178-192, 194-198, 
200-204, 206: 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 255, 
257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297. As used 
herein, the term ''polypeptide'' encompasses amino acid chains of any length, including 
25 full length proteins containing the sequences recited herein. A polypeptide comprising 
an epitope of a protein containing a sequence as described herein may consist entirely of 
the epitope, or may contain additional sequences. The additional sequences may be 
derived from the native protein or may be heterologous, and such sequences may (but., 
need not) possess immunogenic or antigenic properties. 
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An "epitope,'' as used herein is a portion of a polypeptide that is 
recognized (i.e., specifically bound) by a B-cell and/or T-cell surface antigen receptor. 
Epitopes may generally be identified using well known techniques, such as those 
summarized in Paul, Fundamental Immunology^ 3rd ed., .243-247 (Raven Press, 1993) 
5 and references cited therein. Such techniques include screening polypeptides derived 
from the native polypeptide for the ability to react with antigen-specific antiscra and^or 
T-cell lines or clones. An epitope of a polypeptide is a portion that reacts with such 
antisera and/or T-cells at a level that is similar to the reactivity of the full length 
.polypeptide {e.^,, in an ELISA and/or T-cell reactivity assay). Such screens may 
10 generally be performed using methods well known to those of ordinarv skill in the art, 
such as those described in Hariow and Lane, Antibodies: A Laboratoiy ManuaU Cold 
Spnng Harbor Laboratory, 1988. B-cell and T-cell epitopes may also be predicted via 
computer analysis;: Polypeptides comprising an epitope of, a polypeptide that is 
preferentially expressed in a : tumor tissue (with or . without additional amino acid 
1 5 sequence) are within the scope of the present invention. 

The compositions and methods of the present invention also encompass 
variants of the . above polypeptides and nucleic acid , sequences encoding such 
polypeptides. A polypeptide "variant," as used herein, is a polypeptide that differs from 
the native polypeptide in substitutions and/or modifications^ such that the antigenic 
20 and/or immunogenic properties of the polypeptide are retained. Such variants may 
generally be identified by modifying one of the above polypeptide sequences and 
evaluating the reactivity of the modified polypeptide with antisera and/or T-cells as 
described above. Nucleic acid variants may contain one or more substitutions, 
deletions, insertions and/or modifications such that the antigenic and/or immunogenic 
25 properties of the encoded polypeptide are retained. One preferred variant of the 
polypeptides described herein is a variant that contains nucleotide substitutions, 
deletions, insertions and/or modifications at no more than 20% of the nucleotide 
positions. 

Preferably, a variant contains conservative substitutions. A 
30 "conservative substitution" is one in which an amino acid is substituted for another 
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10 

amino acid thai has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary stmcture and hydropathic nature of the 
polypeptide to be substantially unchanged. In general, the following groups of amino 
acids represent conservative changes: (l)ala, pro, gly, glu, asp, gin, asn, ser, thr; 
5 (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phc, tyr, trp, 
his. 

Variants may also (or alternatively) be modified by, for example, the 
deletion or addition of amino acids that have minimal influence on the immunogenic or 
antigenic properties, secondary structure and hydropathic nature of the polypeptide. For 
10 example, a polypeptide may be conjugated to a signal (or leader) sequence at the N- 
terminal end of the protein which co-lranslationally or post-lranslationally directs 
transfer of the protein. The polypeptide may also be conjugated to a linker or other 
sequence for ease of synthesis, purification or identification of the polypeptide {e,g., 
poly-His), or to enhance binding of the polypeptide to a solid support. For example, a 

15 polypeptide may be conjugated to an immunoglobulin Fc region. 

In general, nucleotide sequences encoding all or a portion of the 
polypeptides described herein may be prepared using any of several techniques. For 
example, cDNA molecules encoding such polypeptides may be cloned on the basis of 
the breast tumor-specific expression of the corresponding niRNAs, using differential 

20 display PCR. This technique compares the amplified products from RNA template 
prepared from normal and breast tumor tissue. cDNA may be prepared by reverse 
transcription of RNA using a (dT),2AG primer. Following amplification of the cDNA 
using a random primer, a band corresponding to an amplified product specific to the 
tumor RNA may be cut out from a silver stained gel and subcloned into a suitable 

25 vector {e.g., the T-vector, Novagen, Madison, WI). Nucleotide sequences encoding all 
or a portion of the breast tumor-specific polypeptides disclosed herein may be amplified 
from cDNA prepared as described above using the random primers shown in SEQ ID 
NO.:87-125. 

Alternatively, a gene encoding a polypeptide as described herein (or a 
30 portion thereof) may be amplified jfrom human genomic DNA, or from breast tumor 
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cDNA, via polymerase chain reaction. For this approach, B18Agl sequence-specific 
primers may be designed based on the sequence provided in SEQ ID NO:l, and may be 
purchased or synthesized. One suitable primer pair for amphfication from breast tumor 
cDNA is (5'ATG GCl ATT TTC GGG GGC TGA CA) (SEQ ID NO.: 126) and 
5 (5'CCG GTA TCT CCT CGT GGG TAT T) (SEQ ID NO.:127). An amplified portion 
of BlSAgl may then be used to isolate the full length gene from a human genomic 
DNA library or from a breast tumor cDNA library, using well known techniques, such 
as those described in Sambrook et al.. Molecular Cloning: A Lahorator\> Manual, Cold 
Spring Harbor Laboratories, Cold Spring Harbor, NY (1989). Other sequences within 
10 the retroviral genome of which BISAgl is a part may be similarly prepared by 
screening human genomic hbraries using BlSAgl -specific sequences as probes. 
Nucleotides translated into protein from the retroviral genome shown in SEQ ID NO: 
141 may then be determined by cloning the corresponding cDNAs, predicting the open 
reading frames and cloning the appropriate cDNAs into a vector containing a viral 
15 promoter, such as T7. The resulting constructs can be employed in a translation 
reaction, using techniques known to those of skill in the art, to identify nucleotide 
sequences which result in expressed protein. Similarly, pnmers specific for the 
remaining breast tumor-specific polypeptides described herein may be designed based 
on the nucleotide sequences provided in SEQ ID NO: 11 - SEQ ID NO:86 and SEQ ID 
20 NO:142r SEQ IDNO:297, 

Recombinant polypeptides encoded by the DNA sequences described 
above may be readily prepared from the DNA sequences. For example, supematants 
from suitable host/vector systems which secrete recombinant protein or polypeptide into 
culture media may be first concentrated using a commercially available filter. 
25 Following concentration, the concentrate may be applied to a suitable purification 
matrix such as an affinity matrix or an ion exchange resin. Finally, one or more reverse 
phase HPLC steps can be employed to further purify a recombinant polypeptide. 

In general, any of a variety of expression vectors known to those of, 
ordinary skill in the art may be employed to express recombinant polypeptides of this 
30 mvention. Expression may be achieved in any appropriate host cell that has been 
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transformed or transfecled with an expression vector containiniz a DNA molecule that 
encodes a recombinant polypeptide. Suitable host cells include prokaryotes, yeast and 
higher eukaryotic cells. Preferably, the host cells employed are E. coli, yeast or a 
mammalian cell line such as COS or CHO. 
5 Such techniques may also be used to prepare polypeptides comprising 

epitopes or variants of the native polypeptides. For example, variants of a native 
polypeptide may generally be prepared using standard mutagenesis techniques, such as 
oligonuclcolide-directed site-specific mutagenesis, and sections of the DNA sequence 
may be removed to permit preparation of truncated polypeptides. Portions and other 
10 vanants having fewer than about 100 amino acids, and generally fewer than about 50 
amino acids, may also be generated by synthetic means, using techniques well known to 
those of ordinary skill m the art. For example, such polypeptides may be synthesized 
using any of the commercially available solid-phase techniques, such as the Merrifield 
solid-phase synthesis method, where ammo acids are sequentially added to a growing 
15 amino acid chain. See Merrifield,/. Am, Chem. Soc. 5.>:2 149-2 146 (1963). Equipment 
for automated synthesis of polypeptides is commercially available from suppliers such 
as Perkin Elmer/ Applied BioSysterns Division,, Foster City, CA, and may be operated 
according to the manufacturer's instructions. 

In specific embodiments, polypeptides of the present invention 
20 encompass amino acid sequences encoded by a DNA molecule having a sequence 
recited in any one of SEQ ID NO: 1, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 
178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 
250, 251. 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 
291-297, variants of such polypeptides that are encoded by DNA molecules containing 
25 one or more nucleotide substitutions, deletions, insertions and/or modifications at no 
more than 20% of the nucleotide positions, and epitopes of the above polypeptides. 
Polypeptides within the scope of the present invention also include polypeptides (and 
epitopes thereof) encoded by DNA sequences that hybridize to a DNA molecule having 
a sequence recited in any one of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154- 
30 166, 168-176, 178-192, 194-198, 200-^204, 206, 207, 209-214, 216, 218, 219, 221-240, 
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243-245, 247, 250, 251, 253, 255, 257-266, 268. 269, 271-273, 275, 276, 278, 280, 281, 
284, 288 and 291-297 under stringent conditions, wherein the DNA sequences arc at 
least 80% identical ui overall sequence to a recited sequence and wherein RNA 
corresponding to the nucleotide sequence is expressed at a greater level in human breast 
5 tumor tissue than in normal breast tissue. As used herein, "stringent conditions" refers 
to prewashing in a solution of 6X SSC, 0.2% SDS; hybridizing at 65°C, 6X SSC. 0.2% 
SDS overnight; followed by two washes of 30 minutes each in IX SSC, 0.1% SDS at 
65°C and two washes of 30 minutes each in 0.2 X SSC, 0.1% SDS at 65°C. DNA 
- molecules according to the present invention include molecules that encode any of the 
10 above polypeptides. ' 

In another aspect of the present invention, amibodies are provided. Such 
antibodies may be prepared by any . of a variety of technicjues known to those of 
ordinar>' skill in the art. See; e.g., Harlow and Lane, Antibodies: A Lahoraioiy Manual, 
Cold Spring Harbor Laboratory, ' 1988.- In one such technique, an immunogen 
15 comprising the polypeptide is initially injected into any of a wide variety of mammals 
{e.g.. mice, rats, rabbits, .sheep. or goats). In this step, the polypeptides of this invention 
may serve as the immunogen without modification. Alternatively, particularly for 
relatively short polypeptides, a superior immune response may be elicited if the 
polypeptide is joined to a carrier protein, such as bovine serum albumin or keyhole 
20 limpet hemocyanin. The immunogen is injected into the animal host, preferably 
according to a predetermined schedule incorporating one or more booster 
immunizations, and the animals are bled periodically. Polyclonal antibodies specific for 
the polypeptide may then be purified from such antisera by, for example, affinity 
chromatography using the polypeptide coupled to a suitable solid support. 

Monoclonal antibodies specific for the antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol. 6:51 1-519 (1976); improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the. 
desired specificity {i.e., reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
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described above. The spleen cells are then immonalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety effusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
5 then plated at low density on a selective medium that supports the growth of hybrid 
ceils, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and their culture 
supematants tested for binding activity against the polypeptide. Hybndomas having 

1 0 high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supematants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 

15 the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 

Antibodies may be used, for example, in methods for detecting breast 

20 cancer in a patient. Such methods involve using an antibody to detect the presence or 
absence of a breast tumor-specific polypeptide as descnbed herein in a suitable 
biological sample. As used herein, suitable biological samples include tumor or normal 
tissue biopsy, mastectomy, blood, lymph node, serum or urine samples, or other tissue, 
homogenate, or extract thereof obtained from a patient. 

25 There are a variety of assay formats known to those of ordinary skill in 

the art for using an antibody to detect polypeptide markers in a sample. See^ e.g.^ 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. For example, the assay may be performed in a Western blot format, wherein ^j. 
protein preparation from the biological sample is submitted to gel electrophoresis, 

30 transferred to a suitable membrane and allowed to react with the antibody. The 
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presence of the antibody on the membrane may then be detected using a suitable 
detection reagent, as described below. 

In another embodiment, the assay involves the use of antibody 
immobilized on a solid support to bmd to the polypeptide and remove it from the 
5 remainder of the sample. The bound polypeptide may then be detected using a second 
antibody or reagent that contains a reporter group. Alternatively, a competitive assay 
may be utilized, in which a polypeptide is labeled with a reporter group and allowed to 
bind to the immobilized antibody after incubation of the antibody with the sample. The 
- extent to which components of the sample inhibit the binding of the labeled polypeptide 
10 to the antibody is indicative of the reactivity of the sample with the immobilized 
antibody, and as a result, mdicative of the concentration of polypeptide in the sample. 

The solid support may be any material known to those of ordinary skill 
in the art to whicK the antibody may be attached. For example, the solid support may be 
a lest well" in a microtiter piate or a nitrocellulose filter or other suitable membrane. 
15 Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchlonde. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. 

The antibody may be immobilized on the solid support using a variety of 
20 techniques known to those in the art, which are amply described in the patent and 
scientific literature. In the context of the present invention, the term "immobilization" 
refers to both noncovalent association, such as adsorption, and covalent attachment 
(which may be a direct linkage between the antigen and functional groups on the 
support or may be a linkage by way of a cross-linking agent). Immobilization by 
25 adsorption to a well in a microtiter plate or to a membrane is preferred In such cases, 
adsorption may be achieved by contacting the antibody, in a suitable buffer, with the 
sohd support for a suitable amount of time. The contact lime varies with temperature, 
but is typically between about 1 hour and 1 day. In general, contacting a well of a . 
plastic microtiter plate (such as polystyrene or polyvinylchloride) with an amount of 
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antibody ranging from about 10 ng to about 1 jag, and preferably about 100-200 ng, is 
sufficient to immobilize an adequate amount of polypeptide. 

Covalent attachment of antibody to a solid support may also generally be 
achieved by first reacting the support with a bifunctional reagent that will react with 
5 both the support and a functional group, such as a hydroxyl or amino group, on the 
antibody. For example, the antibody may be covalently attached to supports having an 
appropriate polymer coating using bcnzoquinone or by condensation of an aldehyde 
group on the support with an amine and an active hydrogen on the binding partner {see^ 
- c\g.. Pierce Immunotechnology Catalog and Handbook (1991) at A12-A13). 

10 In certain en.bodiments, the assay for detection of polypeptide in a 

sample is a two-antibody sandwich assay. This assay may be perfomied by first 
contacting an antibody that has been immobilized on a solid support, commonly the 
well of a microiuer plate, v^ith the biological sample, such that the polypeptide within 
the sample are allowed to bind lo the immobilized antibody. Unbound sample is then 

15 removed from the immobilized polypeptidc-antibody complexes and a second antibody 
(containing a reporter group) capable of binding to a different site on the polypeptide is 
added- The amount of second antibody that remains bound to the solid support is then 
detennined using a method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 

20 described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such 
as bovine scrum, albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 

25 phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time {i.e., incubation time) is that penod of time that is sufficient to detect the presence 
of polypeptide within a sample obtained from an individual with breast cancer. 
Preferably, the contact tune is sufficient to achieve a level of binding that is at leasts, 
95% of that achieved at equilibrium between bound and unbound polypeptide. Those of 

30 ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
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may be readily dccrmined by assaying *e level of binding .ha. oce.rs over a period of 
■.me. A. roon, .en,pera.ure. an incuba.ion .nnc of abou. 30 minu.es ,s senerally 
sufficient. 



Unbound sample may .hen be removed by washing ,he solid support 
5 w„h an appropriate buffer, such as PBS eoMa.ning 0.1% Tween 20™ The second 
an.,body, „h,ch contains a reponer group, n,ay then be added to the solid support 
Preferred reporter groups include enzymes (such as hot^eradish peroxidase,, substrates 
cofacors, ,„hibi,ors, dyes,- radionuclides, luminescent groups, Huorescen. groups and 
- b,ot.„. The conjugation of amibody to reporter group ™ay be achieved usina standard 
1 0 methods known to those of ordinary skill in (he an. 

The second antibody is then incubated wi.h , he imuwbihzed an.ibodv- 
polypep.,de cot^plex for an amoun, of „n,e sufficien, to detect the bound polypeptide 
A„ appropnatc amotint of .,n,b may generally be de.entrtned by assaying .he level of 
b,„d,„g .ha, occurs over a period of ti„e. Unbound second antibody is then removed 
and hound second antibody is detected- using the reportergroup. The method employed 
for detecing the reporter group depends upon the nature of the reporter group For 
rad,oae„ve groups, scin.illa.ion Counting or autoradiogntphic methods are generally 
appropnatc. Spectroscopic methods may be used to detect dyes, lumtnescen. Groups 
and fluorescent groups. Biotin n,ay be deteced using av.din, coupled ,o a dtfferen. 
reporter group (commonly a rad.oac.ivc or nuorescent group or an enzyme). Enzyme 
reponer groups may generally be detected by .he addi.ion of subsmt.e (genentlly fo, a 
spec,f,c penod of .i„e), followed by specttoscopic or other analysis of the reaction'' 
products. . 

To deterirtne the presence or absence of bteas. cancer, the signal 
-5 detected ftom dte reporter group that remains bound to the solid suppor. is generally 
compared ,o a signal that corresponds to a prede,en.,ned cut-off value establtshed from 
non-,un,or tissue. In one preferred embodiment, the c«-off value is .he average mean 
stgnal ob,ai„ed when ,he imtnobilized antibody is incubated with samples from patients, 
wthout breast cancer ,n general, a sample generating a signal to, is three standard ' 
dev,a.,ons above the prede^rmined cu.-offvalue may be considered positive for breast 
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cancer. In an alternate preferred embodiment, the cut-off value is determined using a 
Receiver Operator Curx'c, according to the method of Sackett et al.. Clinical 
Epidemiology: A Basic Science for Clinical Medicine^ p. 106-7 (Little Brown and Co., 
1985). Briefly, m this embodiment, the cut-off value may be determined from a plot of 
5 pairs of true positive rates (/.e., sensitivity) and false positive rates (100%-specificity) 
that correspond to each possible cut-off value for the diagnostic test result. The cut-off 
value on the plot that is the closest to the upper left-hand comer the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
-signal that is higher than the cut-off value determined by this method may be considered 

10 positive. Alternatively, the cut-off value may be shifted to the left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate, in 
general, a sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for breast cancer. 

In a related embodiment, the assay is performed in a flow-through or 

15 strip test format, wherein the antibody is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, the polypeptide within the sample bind to the 
immobilized antibody as the sample passes through the membrane. A second, labeled 
antibody then binds to the antibody-polypeptidc complex as a solution containing the 
second antibody flows through the membrane. The detection of bound second antibody 

20 may then be performed as described above. In the strip test fonnat, one end of the 
membrane to which antibody is bound is immersed in a solution containing the sample. 
The sample migrates along the membrane through a region containing second antibody 
and to the area of immobilized antibody. Concentration of second antibody at the area 
of immobilized antibody indicates the presence of breast cancer. Typically, the 

25 concentration of second antibody at that site generates a pattern, such as a line, that can 
be read visually. The absence of such a pattern indicates a negative result. In general, 
the amount of antibody immobilized on the mem.brane is selected to generate a visually 
discernible pattern when the biological sample contains a level of polypeptide that., 
would be sufficient to generate a positive signal in the two-antibody sandwich assay, in 

30 the formal discussed above. Preferably, the amount of antibody immobilized on the 
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n.e,nbn,ne ranges fro. abou, 25 „, ,o abou, , and ™orc preferably fron, abou, 50 
,o abou, , such ,e.s can „ be perfonne. wl,b a very s„,a„ a.ou„, „f 

biological sample. 

^''-"« °r cancer in a pa,ie„, may also be 

<ie,e™,ned by eva,ua.in« ,„e leve, of .RNA encoding a breas, ,un,or-speci«c 
polypepHdc as descnbed bare,,, wi,hi„ ,be biologica, sample a biopsy, 

n.as,ec,o„,y and/or biood samp,. f,.m a pa.ien., .,a.,ve .o a pred=,e™i„ed cu.-off 
value. Sucb an evalua.ion may be achieved using any of a vane.y of methods known .„ 
-.hose Of ordinary, s.,11 in-, he ar. such as, for example, ,„ .,,„ hybnd,za..on and 
10 amplification by polymerase chain reaction. 

For example; polymerase chain reaction may be used to amplify 
sequences from eDNA prepared fr5m RNA that ,s isolated from one of the above 
biological samples. Se,uence-speeif,c primers for use ,„ such amplification may be 
designed based on the sequences pr^ovtded in any one of SEQ ,D NO: 1, 1 ,-S6 and 14.- 
297. and may be purebased or synthesized. ,„ the case of B.SAgl, as noted herein one 
suitable primer pair ,s BI8Agl.2 (5'ATG GCT ATT TTC GGG GCC TGA CA) (SEO 
.D NO.:126, and B,8Ag,-3 (5-CCO GTA TCT CCT CGT GGG TAT T) ,SEQ ,D 
NO.: ,27). The PGR reaction products may then be separated by gel electrophoresis and 
_ visua ized accordtng to m«hods well known to those of ordinary ski,, in the a„ 
20 Amplification is typically performed on samples ob.ained from matched pairs oftissue 
(tumor and „o„-tumor tissue from the same individual, or from unmatched pait. of 
..-ue (tumor and „on-tun,or tissue fi-om differ„t individuals). The amplification 
reaetton is preferably perfonned on several dtlutions of cDNA spanning two orders of 
magnitude. A ,wo-fo,d or greaier increase in expression in severa, ddutions of the 
■umor sample as compared to^the same dilution of the non-tumor sample is considered 
positive. 

As used herein, the term '-primer/probe specific for a DNA/RNA 
mo ecule" means an oligonucleotide sequence .hat has a, leas, abou, 80% idenri,y 

30 OhZV' 'T t""' """" « - «%. identity ,o the 

DNA/RNA molecule ,n question. Primers and/or probes which may be usefufiy 



25 

positive 
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employed in the inventive diagnostic methods preferably have at least about 10-40 
nucleotides. hi a preferred embodiment, the polymerase chain reaction primers 
comprise at least about 10 contiguous nucleotides of a DNA/RNA molecule encoding 
one of the polypeptides disclosed herein. Preferably, oligonucleotide probes for use in 
5 the inventive diagnostic methods comprise at least about 15 contiguous 
oligonucleotides of a DNA/RNA molecule encoding one of the polypeptides disclosed 
herein. Techniques for both PCR based assays and in situ hybridization assays are well 
known in the art. 

Conventional RT-PCR protocols using agarose and ethidium bromide 

10 staining while important in defming gene specificity do not Jcnd themselves to 
diagnostic kit development because of the time and effort required in making them 
quantitative (i.e., construction of saturation and/or titration curves), and their sample 
tliroughput. This problem is overcome by the development of procedures such as real 
time RT-PCR which allows for assays lo be performed in single tubes, and in turn can 

15 be modified for use in 96 well plate formats. Instrumentation to perform such 
methodologies are available from Perkin Elmer/Applied Biosystems Division. 
Alternatively, other high throughput assays using labeled probes (e.g., digoxygenin) in 
combination with labeled (e.g., enzyme fluorescent, radioactive) antibodies to such 
probes can also be used in the development of 96 well plate assays. 

20 In yet another method for determining the presence or absence of breast 

cancer in a patient, one or more of the breast tumor-specific polypeptides described 
may be used in a skin test. As used herein, a "skin test" is any assay performed directly 
on a patient in which a delayed-type hypersensitivity (DTH) reaction (such as swelling, 
reddening or demiatitis) is measured following intradermal injection of one or more 

25 polypeptides as described above. Such injection may be achieved using any suitable 
device sufficient to contact the polypeptide or polypeptides with dermal cells of the 
patient, such as a tuberculin syringe or 1 mL syringe. Preferably, the reaction is 
measured at least 4S hours after injection, more preferably 48-72 hours. 

The DTH reaction is a cell-mediated immune response, which is greater 

30 in patients that have been exposed previously to a test antigen {i.e., an immunogenic 
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portion of a polypeptide employcil, or a variant thcreoO. The response may measured 
visually, using a ruler. In general, a response that is greater than about 0.5 cm in 
diameter, preferably greater than about 5.0 cm in diameter, is a positive response, 
indicative of breast cancer. 
5 The breast tumor-specific polypeptides described herein are preferably 

formulated, for use in a skin test, as pharmaceutical compositions containing at least 
one polypeptide and a physiologically acceptable carrier, such as water, saline, alcohol, 
or a buffer. Such compositions typically contain one or more of the above polypeptides 
in an amount ranging from about K^g to 100 >tg, preferably from about 10 |ag to 50 |.ig 
10 in a volume of 0.1 mL. Preferably, the . carrier employed in such pharmaceutical 
compositions is a saline solution with appropriate preservatives, such as phenol and/or 
Tween 80™. 

, In other aspects of the present invention, the progression and/or response 
to treatment of a breast cancer may be monitored by performing' any of the above assays 

15 over a period of time, and evaluating the change in. the level of the response (i.e., the 
amount of polypeptide or mRNA detected or, in the case of a skin test, the extent of the 
immune response detected). For example, the assays may be performed every month to 
every other month for a period of 1 to 2 years. In general, breast cancer is progressing 
in those patients in whom the level of the response' increases over time. In contrast, 

20 breast cancer is not progressing when the signal detected either remains constant or 
decreases with time. - . 

In further aspects of the present invention, the compounds described 
herein may be used for the immunotherapy of breast cancer.. In these aspects, the 
compounds (which may be polypeptides, antibodies or nucleic acid molecules) are 

25 preferably incorporated into pharmaceutical compositions or vaccines. Pharmaceutical 
compositions comprise one or more such compounds and a physiologically acceptable 
carrier. Vaccines may comprise one or more polypeptides and an immune response 
enhancer, .such as an adjuvant or a liposome (into which the compound is incorporated) 
Pharmaceutical compositions and vaccines may additionally contain a delivery system, 

30 such as biodegradable microspheres which are disclosed, for example, in U.S. Patent 
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Nos. 4,897.268 and 5,075,109. Pharmaceutical compositions and vaccines within the 
scope of the present invention may also contain other compounds, including one or 
more separate polypeptides. 

Alternatively, a vaccine may contain DNA encoding one or more of the 
5 polypeptides as described above, such thai the polypeptide is generated in situ. In such 
vaccines, the DNA may be present within any of a variety of delivery systems known to 
those of ordinary skill in the art, including nucleic acid expression systems, bacteria and 
viral expression systems. Appropriate nucleic acid expression systems contain the 
- necessary DNA sequences for expression in the patient (such as a suitable promoter and 

10 terminating signal). Bacterial delivery systems involve the administration of a 
bacterium (such as Bacilliis-Calmene-Guerrin) that expresses an immunogenic portion 
of the polypeptide on its cell surface. In a preferred embodiment, the DNA may be 
mtroduced using a viral expression system {e.g., vaccinia or other pox virus, retrovirus, 
or adenovirus), which may involve the use of a iion- pathogenic (defective), replication 

1 5 competent virus. Techniques for incorporating DNA into such expression systems are 
we!) known to those of ordinary skill in the art. The DNA may also be "naked," as 
described, for example, in Ulmer et al.. Science 259:1745-1749 (1993), and reviewed by 
Cohen, Science 259:1691-1692 (1993). The uptake of naked DNA may be increased by 
coating the DNA onto biodegradable beads, which are efficiently transported into the 

20 cells. 

While any suitable carrier known to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 
will vary depending on the mode of administration. For parenteral administration, such 
as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, a 
25 wax or a buffer. For oral administration, any of the above carriers or a solid carrier, 
such as mannitol, lactose, starch, magnesium stearate, sodium saccharine, talcum, 
cellulose, glucose, sucrose, and magnesium carbonate, may be employed. 
Biodegradable microspheres {e.g , polylactate polyglycolate) may also be employed as-, 
carriers for the pharmaceutical compositions of this invention. 
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Any of a variety of adjuvants may be employed in the vaccines of this 
invention to nonspecificaliy enhance the immune response. Most adjuvants contain a 
substance designed to protect the antigen from rapid cataboiism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, 
5 Bortadella pertussis or Mycohaciermin tuberculosis derived proteins. Suitable 
adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, MI), Merck Adjuvant 65 (Merck 
and Company, Inc., Rahway, NJ), alum, biodegradable microspheres, monophosphoryl 
- lipid A and quil A. Cytokines, such as GM-CSF or interleukin-2, -7. or -12, may also 
10 be used as adjuvants. 

The above phamiaceutical compositions and vaccines may be used, for 
example, for the therapy of breast cancer in a patient. As used herein, a "patient" refers 
to any wami-blooded animal, preferably a human. A patient may or may not be 
afflicted, with breast cancer. Accordingly, the above pharmaceutical compositions and 
15 vaccines may be used to prevent the development of breast cancer or to treat a patient 
afflicted, with breast cancer. To prevent the development of breast cancer, a 
phannaceutical composition or vaccine comprising one or more polypeptides as 
described herein may be administered to 'a patient. Alternatively, naked DNA or 
plasmid or viral vector encoding the polypeptide may be administered. For treating a 

20 patient with breast cancer, the phannaceutical composition or vaccine may comprise 
one or more polypeptides, antibodies or nucleotide sequences complementary to DNA 
encoding a polypeptide as described herein {e.g., antisense RNA or antisense 
deoxyribonucleotide oligonucleotides). 

Routes and frequency of administration, as well as dosage, will vary 

25 from individual to individual. In general, the pharmaceutical compositions and 
vaccines may be administered by injection {e.g., intracutaneous, intramuscular, 
intravenous or subcutaneous), intranasally {e.g, by aspiration) Or orally. Between 1 and 
10 doses , may be administered for a 52-week period. Preferably, 6 doses are 
administered, at intervals of 1 month, and booster vaccinations may be given 

30 periodically thereafter. Alternate protocols may be appropriate for individual patients. 
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A suitable dose is an amount of a compound that, when administered as described 
above, is capable of promoting an anti-tumor immune response. Such response can be 
monitored by measuring the anti-tumor antibodies in a patient or by vaccine-dependent 
generation of cj^olytic effector cells capable of killing the patient's tumor cells in vitro. 
5 Such vaccines should also be capable of causing an immune response that leads to an 
improved clinical outcome (e.^., more frequent remissions, complete or partial or longer 
disease- free survival) in vaccmated patients as compared to non- vaccinated patients. In 
general, for pharmaceutical compositions and vaccines comprising one or more 
-polypeptides, the amount of each polypeptide present in a dose ranges from about 
10 100 iig to 5 mg. Suitable dose sizes will vary with the size of the patient, but will 
typically range from about 0.1 mL to about 5 mL. 

The following Examples are offered by way of illustration and not by 
way of limitation. 
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EXAMPLKS 
EXAMPLE 1 

Preparation of Breast Tumor-Specific cDNAs Using 
5 Differential Display RT-^PCR 

This Example illustrates the preparation of cDNA molecules encoding 
breast tunior-specific polypeptides using a differentia] display screen. 

A. Prcparaiion of RlSAgl cDNA and Charaeterization of niRhlA Expression 

10 Tissue samples were prepared from breast tumdi" and norma] tissue of a 

patient with breast cancer that was confirmed by pathology after removal from the 
patient. Normal RNA and tunwr'RNA was extracted from the samples and mRNA was 
isolated and convened into cDNA using a (dT)i3AG (SEQ ID NO.: 130) anchored 3' 
primer. Differential display PCR was then executed using a randomly chosen primer 

15 (CTfCAACCTC) (SEQ ID NO.:103). Amplification conditions were standard buffer 
containing 1 .5 mM MgCU, 20 pmol of primer, 500 pmol dNTP, and 1 unit of Taq DNA 
polymerase (Perkin-Elmer, Branchburg, NJ). Forty cycles of amplification were 
performed using 94*^C denaturation for 30 seconds, 42°C annealing for 1 minute, and 
72°C extension for 30 seconds. Ai RNA fingerprint containing 76 ampHfied products 

20 was obtained. Although the RNA fingerprint of breast tumor tissue was o\'er 98% 
identical to that of the normal breast tissue, a band was repeatedly observed to be 
specific to the RNA fingerprint pattern of the tumor. This band was cut out of a silver 
stained gel, subcloned into the T-vector (Novagen, Madison, WI) and sequenced. 

The sequence of the cDNA, referred to as BlSAgl, is provided in SEQ 

25 ID NOT. A database search of GENBANK and EMBL revealed that the B18Agl 
fragment initially cloned is 77% identical to the endogenous human retroviral clement 
S71, which is a truncated retroviral element homologous to the Simian Sarcoma Virus^ 
(SSV). S71 contains an incomplete g^ag^ gene, a portion of the pol gene and an LTR-like 
structure at the 3' terminus {see Werner et al.. Virology 7 7^:225-238 (1990)). BlSAgl 
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is also 64^0 identical to SSV in the region corresponding to the P30 (gag) locus. 
BlSAgl contains three separate and incomplete reading frames covering a region which 
shares considerable homology to a wide variety of gag proteins of retroviruses which 
infect mammals. In addition, the homology to S71 is not just within the gag gene, but 
5 spans several kb of sequence including an LTR. 

BlSAgl -specific PCR primers were synthesized using computer analysis 
guidelines. RT-PCR amplification (9rC, 30 seconds; 6()°C -> 42'='C, 30 seconds; 72°C, 
30 seconds for 40 cycles) confirmed that BlSAgl represents an actual niRNA sequence 
-present at relatively high levels in the patient's breast tumor tissue. The primers used 

10 in amplification were B18Agl-l (CTG CCT GAG CCA CAA ATG) (SEQ ID NO.:128) 
and BISAgl-4 (CCG GAG GAG GAA GCT AGA GGA ATA) (SEQ ID NO.T29) at a 
3.5 m.M magnesium concentration and a pH of 8.5, and B18Agl-2 (ATG GCT ATT 
TTC GGG GCC TGA CA) {SEQ ID NO.: 126) and B18Agl-3 (CCG GTA TCT CCT 
CGT GGG TAT T) (SEQ ID NO.:127) at 2 mM magnesium at pH 9.5. The same 

15 experiments showed exceedingly low to nonexistent levels of expression in this 
patient's normal breast tissue (see Figure 1). RT-PCR experiments were then used to 
show that BlSAgl mRNA. is present in nine other breast tumor samples (from Brazilian 
and American patients) but absent in, or at exceedingly low^ levels in, the normal breast 
tissue corresponding to each cancer patient. RT-PCR analysis has also shown that the 

20 BlSAgl transcript is not present in various normal tissues (including lymph node, 
myocardium and liver) and present at relatively low levels in PBMC and lung tissue. 
The presence of BlSAgl mRNA in breast tumor samples, and its absence from normal 
breast tissue, has been confirmed by Northern blot analysis, as shown in Figure 2. 

The differential expression of BlSAgl in breast tumor tissue was also 

25 confirmed by RNase protection assays. Figure 3 shows the level of BlSAgl mRNA in 
various tissue types as determined in four different RNase protection assays. Lanes 1- 
12 represent various normal breast tissue samples, lanes 13-25 represent various breast 
tumor samples; lanes 26-27 represent normal prostate samples; lanes 28-29 represent; 
prostate tumor samples: lanes 30-32 represent colon tumor samples; lane 33 represents 

30 normal aorta; lane 34 represents nonnal small intestine; lane 35 represents normal skin, 
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lane 36 represents normal lymph node; lane 37 represents nomial ovary; lane 38 
represents normal liver; lane 39 represents normal skeletal muscle; lane 40 represents a 
first normal stomach sample, lane 41 represents a second normal stomach sample; lane 
42 represents a normal lung; lane 43 represents normal kidney; and lane 44 represents 
5 normal pancreas, fnterexpenmental comparison was facilitated by mcluding a positive 
control RNA of known p-actin message abundance in each assay and normalizing the 
results of the different assays with respect to this positive control. 

RT-PCR and Southern Blot analysis has shown' the BlSAgl locus to be 
-present in human genomic UNA as a single copy endogenous retroviral element. A 

10 genomic clone of approximately 12-1 8' kb was isolated usiiig^ the initial BlSAgl 
sequence as a probe. Four additional subclones were also isolated by Xbal digestion. 
Additional retroviral sequences obtained from the ends of' the Xbal digests of these 
. clones (located as shown in Figure 4) are shown as SEQ ID NO:3 - SEQ ID NO:10, 
where SEQ ID NO:3 shows the location of the sequence labeled ^0 in Figure 4, SEQ ID 

15 N0:4 shows the location of the sequence labeled 11-29, SEQ ID NO:5 shows the 
location of the sequence labeled 3, SEQ ID NO:6 shows the location of the sequence 
labeled 6, SEQ ID NO:7 shows the location of the sequence labeled 12, SEQ ID NO:8 
shows the location of the sequence labeled 13, SEQ ID NO:9 shows the location of the 
sequence labeled 14 and SEQ ID NO: 10 shows the location of the sequence labeled 11- 

20 22. 

Subsequent studies demonstrated that the 12-18 kb genomic clone 
contains a retroviral element of about 7.75 kb, as shown in Figures 5 A and 5B. The 
sequence of this retroviral element is shown in SEQ ID NO: 141. The numbered line at 
the top of Figure 5A represents the sense strand sequence of the retroviral genomic 

25 - :clone. The box below this line shows the position of selected restriction sites. The 
arrows depict the different overlapping clones used to sequence the retroviral element. 
. The direction of the arrow shows whether the single-pass subclone sequence 
corresponded to the sense or anti-sense strand. Figure 5B is a schematic diagram of the 
retroviral element containing BlSAgl depicting the organization of viral genes within 

30 the element. The open boxes correspond to predicted reading frames, starting with a 
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methionine, found throughout the element. Each of the six hkely reading frames is 
shown, as indicated to the left of the boxes, with frames 1-3 corresponding to those 
found on the sense strand. 

Using the cDNA of SEQ ID NO:l as a probe, a longer cDNA was 
5 obtained (SEQ ID NO:227) which contains minor nucleotide differences (less than 1%) 
compared to the genomic sequence shown in SEQ ID NO: 141. 

B. Preparation of cDNA Molecules Encoding Other Breast Tumor-Specific 
Polypeptides 

Normal RNA and tumor RNA was prepared and mRNA was isolated and 

10 converted into cDNA using a (dT),.AG anchored 3' primer, as described above. 
Difterential display PGR was then executed using the randomly chosen primers SEQ 
ID NO.: 87-125. Amplification conditions were as noted above, and bands observed to 
be specific to the RN.^ fingerprint pattern of the tumor were cut out of a silver stained 
gel, subcloned into either the T-vcctor (Novagen, Madison, WI) or the pCRII vector 

15 (Invitrogen, San Diego, CA) and sequenced. The sequences are provided in SEQ ID 
NO:l i - SEQ ID NO:86. Of the 79 sequences isolated, 67 were found to be novel (SEQ 
ID NO.: 1 1-26 and 28-77) (see also Figures 6-20). 

An extended DNA sequence (SEQ ID NO: 290) for the antigen B15Agl 
(originally identified partial sequence provided in SEQ ID NO: 27) was obtained in 

20 further studies. Comparison of the sequence of SEQ ID NO: 290 with those in the gene 
bank as described above, revealed homology to the known human P-A activin gene. 

Subsequent studies identified an additional 146 sequences (SEQ ID 
NOS:I42-289), of which 115 appeared to be novel (SEQ ID NOS:I42, 143, 146-152, 
154-166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221- 

25 240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 
281, 284, 288 and 291). To the best of the inventors' knowledge none of the previously 
identified sequences have heretofore been shown to be expressed at a greater level in 
human breast tumor tissue than in normal breast tissue. 

In further studies, six different splice forms of the antigen BllAgl were 

30 isolated, with each of the various splice forms containing slightly different versions of 
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the Bl lAgl coding frame. Splice junction sequences define individual exons which, in 
various patterns and arrangements, make up the various splice forms. Primers were 
designed to examme the expression pattern of each of the exons using RT-PCR as 
described below. Each exon was found to show the same expression pattern as the 
5 original Bl lAgl clone, with expression being breast tumor, prostate and testis-specific. 
The determined cDNA sequences for the isolated protein coding exons are provided in 
SEQ ID NO: 292-297, respectively. 

EXAMPLE 2 

10 Prepara i jon of B18AG1 DNA^from Human Gf.NOMic DNA 

This Example illjustrates the preparation of BlSAgl DKA by 
amplification from human genomic DNA. - 

BlSAgl DNA may be prepared from 2,50, ng human genomic DNA 
15 usmg 20 pmol of BlSAgl specific pnmcrs, 500 pmol dNTPS and I unit of Taq DNA 
polymerase (Pcrkin Elmer, Branchburg, NJ) using the following amplification 
parameters: 94''C for 30 seconds denaturing, 30 seconds 60''Q to 42°C touchdown 
annealing in 2°C increments every two cycles and ll^'C extension for 30 seconds. The 
last increment (a AT'C annealing temperature) should cycle 25 times. Primers were 
20 selected using computer analysis. Primers synthesized were B18Agl-l, B18Agl-2, 
B18Agl-3, and B18Agl-4. Primer pairs that may be used are 1+3, 1+4, 2+3, and 2+4. 

Following gel electrophoresis, the band corresponding to BlSAgl DNA 
may be excised and cloned into a suitable vector. 

25 EXAMPLE 3 

Preparation of BISAgI DNA from Breast Tumor cDNA 

This Example illustrates the preparation of BlSAgl DNA by 
amplification from human breast tumor cDNA. 
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First strand cDNA is synthesized from RNA prepared from human breast 
tumor tissue in a reaction mixture containing 500 ng poly A^- RNA, 200 pmol of the 
primer (T),,AG (/.e., TTT TTT TTT TTT AG) (SEQ ID NO: 130), IX first strand 
reverse transcriptase buffer, 6,7 mM DTT, 500 mmol dNTPs, and 1 unit AMV or 
5 MMLV reverse transcriptase (from any suppher, such as Gibco-BRL (Grand Island, 
NTY)) in a fmal volume of 30 After first strand synthesis, the cDNA is diluted 
approximately 25 fold and I \i\ is used for amplification as described in Example 2. 
While some primer pairs can result in a heterogeneous population of transcripts, the 
primers B18Agl.2 (5^ATG GCT ATT TTC GGG GGC TGA CA) (SEQ ID NO: 126) 
10 and B18Agl-3 (5H?CG GTA TCT CCT CGT GGG TAT T) (SEQ ID NO: 127) yield a 
single 151 bp amplification product. 

EXAMPLE 4 

Identification of B-cf.ll and T-clll Epitopes of BISAgI 

15 

This Example illustrates the identification of BlSAgl epitopes. 
The BlSAgl sequence can be screened using a variety of computer 
algorithms. To determine B-cell epitopes, the sequence can be screened for 
hydrophobicity and hydrophilicity values using the method of Hopp, Prog. Clin. Biol. 

20 Res. 77^5:367-77 (1985) or, alternatively, Cease ct aL, J. Exp. Med 76^:1779-84 
(1986) or Spouge et aL, J. Immunol. 7i5:204-12 (1987). Additional Class II MHC 
(antibody or B-cell) epitopes can be predicted using programs such as AMPHI {e.g., 
Margalit et aL, J. ImmunoL 138:2213 (1987)) or the methods of Rothbard and Taylor 
{e.g.^EMBOJ. 7:93 (1988)). 

25 Once peptides (15-20 amino acids long) are identified using these 

techniques, individual peptides can be synthesized using automated peptide synthesis 
equipment (available from manufacturers such as Perkin Elmer/ Applied Biosystems 
Division, Foster Cit>, CA) and techniques such as Merrifield synthesis. Following,^ 
synthesis, the peptides can used to screen sera harvested from either normal or breast 

30 cancer patients to detemrnne whether patients with breast cancer possess antibodies 
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reactive with the peptides. Presence of such ant.bod.es >n breast cancer patient would 
confirm the nxmninogen.cuy of the specific B-cell epitope in question. The peptides 
can also be tested for their ab.l.ty to generate a serologic or humoral immune in animals 
(mice, rats, rabbits, chimps etc.) following immunization in vivo. Generation of a 
5 peptide-specific antiserum following such immunization further confirms the 
immunogenicity of the specific B-cell epitope in question. 

To identify T-cell epitopes, the BlSAgl sequence can be screened using 
different computer algorithms which are useful in identifying 8-10 ammo acid motifs 
within the BlSAgJ. sequence which are capable of- binding to IILA Class I MHC 
10 molecules, (sec, e.g., Rammensee ct al., /mmmwgeimics 4n]78-228 (1995)). 
Following synthesis such peptides can be tested for their ability to bind to class I MHC 
using standard binding assays (e.g., Sette et al., J. Immunol. /.>-i:5586-92 (1994)) and 
more importantly can be tested for their ability to generate antigen reactive cytotoxic T- 
cells following in vitro stimulation of patient or normal peripheral mononuclear cells 
15 using, for example, the methods of Bakker et al.. Cancer Res. .55:5330-34 (1995); 
Vissercn ^et :al.. J. Immunol. 154:3991-98 (1995); Kawakami et al.. ./ Immunol. 
154:3961^8 (1995); and Kast et al., J. Immunol. /52:3904-12 (1994). Successful 
/« virro generation of T-cells capable of killing autologous (bearing the same Class I 
MHC molecules) tumor cells following in.vitro peptide stimulation further confirms the 
20 immunogenicity of the BlSAgl antigen. Furthermore, such peptides may be used to 
generate murine peptide and BlSAgl reactive cytotoxic T-cells following in vivo 
immunization in mice rendered transgenic for expression of a particular human MHC 
Class I haplotype (Vitiello et al., J. Exp. Med 1 73: 1 007- 1 5 ( 1 99 1 ). 

A representative list of predicted BlSAgl B-cell and T-cell epitopes, 
25 broken down according to predicted HLA Class I MHC binding antigen, is shown 



below: 



Predicted Th Motife p p-t^p^^) (SPO TP MOg.: m U3) 

SSGGRTFDDFHRYLLVGI : 
3^ > QGAAQKPHSfLSKXIEVVQGHDE 
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SPGVFLEHLQEAYRIYTPFDLSA 

Predicted HLA A2.1 Motifs (T-cell epitopes) (SEQ ID NOS.: 134-140) 
YLLVGIQGA 
5 GAAQKPINL 
NLSKXIEVV 
EVVQGHDES 
HLQEAYRIY 
NLAFVAQyVA 
10 FVAQAAPDS 

EXAMPLE 5 

Characterization of Breast Tumor Genes Discovered by 

DlFFEREN'MAL DISPLAY PGR 

15 

The specificity and sensitivity of the breast tumor genes discovered by 
differential display PGR were determined using RT-PCR. This procedure enabled the 
rapid evaluation of breast tumor gene mRNA expression semiquantitatively without 
using large amounts of RNA. Using gene specific primers, mRNA expression levels in 

20 a variety of tissues were examined, mcludmg 8 breast tumors, 5 normal breasts, 2 
prostate tumors, 2 colon tumors, 1 lung tumor, and 14 othernormal adult human tissues, 
including normal prostate, colon, kidney, liver, lung, ovary, pancreas, skeletal muscle, 
skin, stomach and testes. 

To ensure the semiquantitative nature of the RT-PCR, P-actin was used 

25 as internal control for each of the tissues examined. Serial dilutions of the first strand 
cDNAs were prepared and RT-PCR assays performed using p-actin specific primers. A 
dilution was then selected that enabled the linear range amplification of p-actin 
template, and which was sensitive enough to reflect the difference in the initial copy,^ 
number. Using this condition, the P-actin levels were determined for each reverse 

30 transcription reaction from each tissue. DNA contamination was minimized by DNase 
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treatment and by assuring a negative result when using first strand cDNA that was 
prepared without adding reverse transcriptase. 

Using gene specific primers, the mRNA expression levels were 
determined in a variety of tissues. To date, 38 genes have been successfi.illy examined 
5 by RT-PCR, five of which exhibit good specificity and sensitivity for breast tumors 
(B15AG-1, B31GAlb, B38GA2a, BllAla and BlSAGla). Figures 21 A and 2 IB 
depict the results for three of these genes: B15AG-1 (SEQ ID NO:27), B31GAlb (SEQ 
ID NO: 148) and B38GA2a (SEQ ID NO. 157). Table 1 summarizes the expression 
level of all the genes tested in normal breast tissue and breast tumors, and also in other 
10 tissues. 

TABLE I 

Percentage of Breast Cancer Antigens that are Expressed in Various Tissues 



15 



Breast Tissues 



Over-expressed in Breast Tumors 
Equally Expressed in Normals and Tumor 



84% 



20 



Over-expressed in Breast Tumors but 
not in any Norma] Tissues 



9^ 



25 



Other Tissues 



Over-expressed in Breast Tumors but 
Expressed in Some Normal Tissues 

Over-expressed in Breast Tumors but 
Equally Expressed in All Other Tissues 



30*>o 



61% 



30 From the foregoing, it will be appreciated that, although specific 

embodiments of the invention have been described herein for the purpose of illustration, 
various modifications may be made without deviating from the spirit, and scope of the 
invention. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Frudakis, Tony N. 

Smith, John M. 
Reed, Steven G. 

(ii) TITLE OF INVENTION: COMPOSITIONS AND METHODS FOR THE 
TREATMENT AND DIAGNOSIS OF BREAST CANCER 

(iii) NUMBER OF SEQUENCES: 297 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Columbia Center, 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER; IBM PC compatible 

(C) OPERATING SYSTEM: PC- DOS/MS -DOS 

(D) SOFTWARE: Patent In Release #1,0, Version #1.30 

(vi) CURRENT APPLICATION DATA; 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 04 -APR- 1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Maki , David J. 

(B) REGISTRATION NUMBER: 31,3 92 

(C) REFERENCE /DOCKET NUMBER: 210121.419C2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . .363 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TTA GAG ACC CAA TTG GGA CCT AAT TGG GAC CCA AAT TTC TCA AGT GGA 4 8 

Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Giy 
15 10 15 

GGG AGA ACT TTT GAC GAT TTC CAC CGG TAT CTC CTC GTG GGT ATT CAG 96 
Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly lie Gin 
20 25 30 

GGA GCT GCC CAG AAA CCT ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 14 4 

Gly Ala Ala Gin Lys Pro lie Asn Leu Ser Lys Ala lie Glu Val Val 
35 40 45 

CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 
Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
'^O 55 60 

GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT AGC 24 0 

Ala Tyr Arg He Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

CAT GCT CTT AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 2 88 

His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 
85 90 95 

AGG AAA CTC CAA AAA CTA GAG GGA TTT TGC TGG AAT GAA TAC CAG TCA 3 36 

Arg Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 
100 105 110 

GCT TTT AGA GAT AGC CTA AAA GGT TTT 3 63 

Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOr.ECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 

1 5 10 15 

Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val Gly He Gin 

20 25 30 

Gly Ala Ala Gin Lys Pro lie Asn Leu Ser Lys Ala lie Glu Val Val 

35 40 45 ' 

Gin Gly His Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 

50 55 60 

Ala Tyr Arg lie Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
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65 70 75 80 

His Ala Leu Ann Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 

85 90 95 

Arg Lys L-^u Gin Lys Leu GIu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 

100 105 110 

Ala Phe Arg Asp Ser Leu Lys Gly Phe 
115 120 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO • 3 : 



TCTTAGAATC TTCATACCCC GAACTCTTGG GAAAACTTTA ATCAGTCACC TACAGTCTAC 60 

CACCCATTTA GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTTTA AGATCCCCCA 12 0 

TCTTCAAAGC CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC AGGTAAATGC 180 

CAAAAAAGGT CCTAAACCCA GCCCAGGCCA CCGI'CTCCAA GAAAACTCAC CAGGAGAAAA 24 0 

GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT ACCTTCTAGT 3 00 

ACTGGTACAC ACCTTCTCTG GATGGACTGA AGCATTTGCT ACC.AAAAACG AAACTGTCAA 3 60 

TATGGTAGTT AAGTTTTTAC TCAATGAAAT CATCCCTCGA CGTGGGCTGC CTGTTGCCAT 4 20 

AGGGTCTGAT AATGGAJ\CGG CCTTCGCCTT GTCTATAGTT TAATCAGTCA GTAAGGCGTT 4 80 

AAACATTCAA TGGAAGCTCC ATTGTGCCTA TCGACCCAGA GCTCTGGGCA AGTAGA-ACGC 54 0 

ATGAACTGCA CCCTAAAPJ-Jk ACACTCTTAC I\f<JKA'rTAATC TTAAAAACCG GTGTTAA.TTG 6 00 

TGTTAGl'CTC CTTCCCTTAG CCCTACTTAG AGTTAAGGTG CACCCCTTAC TGGGCTGGGT 660 

TCTTTACCTT TTGAAATCAT NTTTNGGAAG GGGCTGCCTA TCTTTNCTTA ACTAAAAAAN 72 0 

GCCCATTTGG CAAAAATTTC NCAACTAATT TNTACGTNCC TACGTCTCCC CAACAGGTAN 7 80 

AAAiXATCTNC TGCCCTTTTC AAGGAACCAT CCCATCCATT CCTNAACAAA AGGCCTGCCN 84 0 

TTCTTCCCCC AGTTA.ACTNT TTTTTNTTAA AATl CCCAAA AAANGAACCN CCTGCTGGAA 90 0 

AAACNCCCCC CTCCAA1>JCCC CGGCCNAAGN GGAAGGTTCC CTTGAATCCC NCCCCCNCNA 96 0 

ANGGCCCGGA ACCNTTAAAN TNGTTCCNGG GGGTNNGGCC TAAAAGNCCN ATTTGGTAAA 10 2 0 

CCTANAAAIT TTTTCTTTTN TAAAAACCAC NWTTTNNTTT TTCTTAAACA AAACCCTNTT 10 8 0 

TNTAGNANCN TATTTCCCNC C 1101 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHAF-ACTERISTICS : 

(A) LENGTH: 1087 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID'N0:4: 

TCTAGAGCTG CGCCTGGATC CCGCCACAGT GAGGAGACCT GAAGACCAGA GAAAACACAG 60 

CAAGTAGGCC CTTTAAACTA CTCACCTGTG TTGTCTTCTA ATTTATTCTG TTTTATTTTG 12 0 

TTTCCATCAT TTTAAGGGGT TAAAATCATC TTGTTCAGAC CTCAGCATAT AAAATGACCC 18 0 

ATCTGTAGAC CTCAGGCTCC AACCATACCC CAAGAGTTGT CTGGTTTTGT TTAAATTACT 24 0 

GCCAGGTTTC AGCTGCAGAT ATCCCTGGAA GGAATATTCC AGATTCCCTG AGTAGTTTCC 300 
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AGGTTAAAAT CCTATAGGCT TCTTGTGTTT TGAGGAAGAG TTCCTGTCAG AGAAAAACAT 360 

GATTTTGGAT TTTTAACTTT AATGCTTGTG AAACGCTATA AAAAAAATTT TCTACGCCTA 4 20 

GGTTTAAAGT ACTGTTAGTG AGAAATTAAA ATTCCTTCAG GAGGATTAAA CTGCCATTTC 4 80 

AGTTACCCTA ATTCCAAATG TTTTGGTGGT TAGAATCTTC TTTAATGTTC TTGAAGAAGT 54 0 

GTTTTATATT TTCCCATCNA GATAAATTCT CTCNCNCGTT NNTTTTNTNT CTNNTTTTTT 6 00 

AAAACGGANT CTTGCTCCGT TGTGGANGCT GGGAATTTTN TTTTGGCCAA TCTCGGGTNC 660 

CTTGCAANAA TNCTGCNTCC CAAAATTACC NCCTTTTTCC CAGCTGCAGC CCNNGGAATT 72 0 

ACCTGGAATT AIJAGGCCCCC UCCCCCCCCC CGGCTAATTT GTTTTTGTTT TTAGTAAAAA 7 80 

ACGGGTTTCC TGTTTTAGTT AGGATGGCCC ANNTGTGAGC CCNTNATCNT CCCCCTCNGC 84 0 

CCTCNAATNT TNGGN^JTANG GCTTACCCCC CCCNGNNGTT TTTCCTCCAT TNAAATTTTC 90 0 

TNTGGANTCT TOAATl^^CGG GTTTTCCCTT TTAAACCNAT TTTTTTTTTN NNNCCCCCAM 96 0 

TTTTNCCTCC CCCNTNTNTA ANGGGGGTTT CCCAANCCGG GTCCNCGCCG ANGTCCCCAA 1020 

TTTTTCTCGC CCCCCCTCTT TTTTCTTTNC CCCAAAANTC CTATCTTTTC CTNNAAATAT 10 80 

CNANTNT 10 87 

(2) IMFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1010 base pairs 

(B) TYPE: nucleic acid i . • 

(C) STRANDEDNESS single 

(D) TOPOLOGY: linear' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



TCTAGACCAA 


GAAATGGGAG 


GATTTTAGAG 


TGACTGATGA 


'^TTTCTCTATC 


ATCTGCAGTT 


60 


AGTAAACATT 


CTCCACAGTT 


TATGCAAAAA 


GTAACAAAAC 


CACTGCAGAT 


GACAAACACT 


120 


AGGTAACACA 


CATACTATCT 


CCCAAATACC 


TACCCACAAG 


CTCAACAATT 


TTAAACTGTT 


180 


AGGATCACTG. 


GCTCTAATCA 


CCATGACATG 


AGGTCACCAC 


CAAACCATCA 


AGCGCTAAAC 


240 


AGACAGAATG 


TTTCCACTCC 


TGATCCACTG 


TGTGGGAAGA 


AGCACCGAAC 


TTACCCACTG 


300 


GGGGGCCTGC 


NTCANAANAA 


AAGCCCATGC 


CCCCGGGTNT 


NCCTTTNAAC 


CGGAACGAAT 


360 


NAACCCACCA 


TCCCCACANC 


TCCTCTGTTC 


NTGGGCCCTG 


CATCTTGTGG 


CCTCNTNTNC 


420 


TTTNGGGGAN 


ACNTGGGGAA 


GGTACCCCAT 


TTCNTTGACC 


CCNCNANAAA 


ACCCCNGTGG 


480 


CCCTTTGCCC 


TGATTCNCNT 


GGGCCTTTTC 


TCTTTTCCCT 


TTTGGGTTGT 


TTAAATTCCC 


54 0 


AATGTCCCCN 


GAACCCTCTC 


CNTNCTGCCC 


AAAACCTACC 


TAAATTNCTC 


NCTANGNNTT 


600 


TTCTTGGTGT 


TNCTTTTCAA 


AGGTNACCTT 


NCCTGTTCAN 


NCCCNACNAA 


AATTTNTTCC 


660 


NTATNNTGGN 


CCCNNAAAAA 


NNNATCNNCC 


CNAATTGCCC 


GAATTGGTTN 


GGTTTTTCCT 


720 


NCTGGGGGAA 


ACCCTTTAAA 


TTTCCCCCTT 


GGCCGGCCCC 


CCTTTTTTCC 


CCCCTTTNGA 


780 


AGGCAGGNGG 


TTCTTCCCGA 


ACTTCCAATT 


NCAACAGCCN 


TGCCCATTGN 


TGAAACCCTT 


840 


TTCCTAAAAT 


TAAAAAATAN 


CCGGTTNNGG 


NNGGCCTCTT 


TCCCCTCCNG 


GNGGGNNGNG 


900 


AAANTCCTTA 


CCCCNAAAAA 


GGTTGCTTAG 


CCCCCNGTCC 


CCACTCCCCC 


NGGAAAAATN 


960 


AACCTTTTCN 


AAAAAAGGAA 


TATAANTTTN 


CCACTCCTTN 


GTTCTCTTCC 




1010 



(2) INFORMATION FOR SEQ ID NO : 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
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TCTAGAGCTC GCGGCCGCGA GCTCTAATAC GACTCACTAT AGGGCGTCGA CTCGATGTCA 6 0 

GCTCACTGCA ATCTCTGCCC CCGGGGTGAT GCGATTCTCC TGCCTCAGCC TTCCAAGTAG 120 

CTGGGATTAC AGGCGTGCAA GACCACACCC GGCTAATTTT GTATTTTTAA TAGAGATGGG ISO 

GTTTTCCGTT GTTGGCCAIJN ATGGTCTCNA ACCCCTGACC TCNNGTGATC CCCCCNCCGN 240 

NGANCTCIJNA CTGCTGGGGA TNNCCGNNNN NNNCCTCCCN NCNCNNNNNN NCNGMNTCCN 3 00 

TNNTCCTTNC TCNIJIINN^JNN CNNTCmTCC NNCTTCTCNC CNNNTNTTNT CNNCNNCCNN 3 60 

CNNNCCNCNT NCCCNCm<^JT TCNCNTNCNN TNTCCNNCNN NNTCNNCNNN CNNNNCNTNN 4 20 

CCNNTACIJTC NTNNIJGMNNT CCNTCTNTIW CCTCNNCNNT CNCTNCKCNT TNTCTCGTCN 4 80 

NTNNNNl^n^CT GCN^IlJmJTCT CNTCNCNNCN TNCCTCNNTN NCCNCNCCCC NCCTCNCNNC 54 0 

CTNNTTTIJNN CNNCMNNTCC NTNCCNTTCN NNTCCNNTNN CNNCNTCNCN NNCNTTNTTC 6 00 

CCNCCNITTTC CTTNCMCNTN NNNTNTCNOTJ CNCNTCIWTC NTTTNCTCCT NNNTCCCNNC 6 60 

TCN14TTCNCC CNNIJTCCNCC CCCCNCCTNT CTCTCNCCCN NNTNNTIJTNTN NNNCNTCCNC 72 0 

TNTCNCIJTTC NTCNIITNCNT TNCTNTCNNC NNCNl^TNCNC TNCCNTNTNT CTNNNTCNCN 7 80 

TCNCNTIJTCN CCNTCCNTTN CTNTCTCCTN TNTCCTTCCC CTCNCCTMCT CNTTCNCCNG 84 0 

CCNNTNTNTN TNNCMCCNNT NCTITNNCNlyC CNTCNTTTCN TCTCTNCTNK NNNTNNCCTC 90 0 

NNCCCNTNCC CTNNTNCNCT NCTNNTACCN TNCTNCTCCM TCTTCCTTCC 9 50 



(2) INFORMATION FOR SEC ID NO : 7 : 



(-) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1086 base pairs 
(E) TYPE: nucleic acid 

(C) STP^JDEDNESS : single 

(D) TOPOLOGY: linear 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



TCTAGAGCTC GCGGCCGCGA GCTCAATTAA CCCTCACTAA AGGGAGTCGA CTCGATCAGA bO 

CTGTTACTGT GTCTATGTAG AAAGAAGTAG ACATAAGAGA TTCCATTTTG TTCTGTACTA 12 0 

AGAAAAATTC TTCTGCCTTG AGATGCTGTT AATCTGTAAC CCTAGCCCCA ACCCTGTGCT 18 0 

CACAGAGACA TGTGCTGTGT TGACTCAAGG TTCAATGGAT TTAGGGCTAT GCTTTGTTAA 24 0 

AAAAGTGCTT GAAGATAATA TGCTTGTTAA AAGTCATCAC CATTCTCTAA TCTCAAGTAC 3 00 

CCAGGGACAC AATACACTGC GGAAGGCCGC AGGGACCTCT GTCTAGGAAA GCCAGGTATT 360 

GTCCAAGATT TCTCCCCATG TGATAGCCTG AGATATGGCC TCATGGGAAG GGTAAGACCT 42 0 

GACTGTCCCC CAGCCCGACA TCCCCCAGCC CGACATCCCC CAGCCCGACA CCCGAAAAGG 4 80 

GTCTGTGCTG AGGAAGATTA NTAAAAGAGG AAGGCTCTTT GCATTGAAGT AAGAAGAAGG 54 0 

CTCTGTCTCC TGCTCGTCCC TGGGCAATAA AATCTCTIGG TGTTAAACCC GAATGTATGT 60 0 

TCTACTTACT GAGAATAGGA GAAAACATCC TTAGGGCTGG AGGTGAGACA CCCTGGCGGC 66 0 

ATACTGCTCT TTAATGCACG AGATGTTTGT KTAATTGCCA TCCAGGGCCA NCCCCTTTCC 72 0 

TTAACTTTTT ATGANACAAA AACTTTGTTC NCTTTTCCTG CGAACCTCTC CCCCTATTAN 78 0 

CCTATTGGCC TGCCCATCCC CTCCCCAAAN GGTGAAAANA TGTTCNT7\AA TNCGAGGGAA 84 0 

TCCAAAA.CNT TTTCCCGTTG GTCCCCTTTC CAACCCCGTC CCTGGGCCNN TTTCCTCCCC 90 0 

AACNTGTCCC GGNTCCTTCN TTCCCNCCCC CTTCCCNGAN AAAAAACCCC GTNTGANGGN 96 0 

GCCCCCTCAA ATTATAACCT TTCCNAAACA AANNGGTTCN AAGGTGGTTT GNTTCCGGTG 102 0 

CGGCTGGCCT TGAGGTCCCC CCTNCACCCC AATTTGGAAN CCNGTTTTTT TTATTGCCCN 1080 

NTCCCC 1086 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1177 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

NCCNTTTAGA TGTTGACAAN NTAAACAAGC NGCTCAGGCA GCTGAAAAAA GCCACTGATA 
AAGCATCCTG GAGTATCAGA GTTTACTGTT AGATCAGCCT CATTTGACTT CCCCTCCCAC 12 0 

ATCGTGTTTA AATCCAGCTA CACTACTTCC TGACTCAAAC TCCACTATTC CTGTTCATgJ 
CTGTCAGGAA CTGTTGGAAA CTACTGAAAC TGGCCGACCt GATCTTCAAA ATGTGCCCCT 
AGGAAAGGTG GATGCCACCG TGTTCACAGA CAGTACCNCC TTCCTCGAGA AGGGACTACG 
AGGGGCCGGT GCANCTGTTA CCAAGGAGAC TNATGTGTTG TGGGCTCAGG CTTTACCANC 
AAACACCiCA NCNCNNAAGG CTGAATTGAT CGCCCTCACT CAGGCTCTCG GATGGGGTAA 
GGGATATTAA ccttaacact gacagcaggt- ACGCCTTTGC TACTGTGCAT GTACGTGGAG 
SJ™^?? f AGCGTGGG CTACTCACTC GGCAGGTGGC TGTNATCCAC TGTAAANgSa 
CATCAAAAGG AAAACNNGGC TGTTGCCCOT GGTAJ^CCANA AANCTGATCN NCAGCTCNAA 
GATGCTGTGT TGACTTTCAC TCNCNCCTCT rAAAtTTGCT GCCCACANTC TCCTTTCCCA 660 
ACCAGATCTG CCTGACAATC GCCATACTCA - AAAA^A.^^ a^;,CTGGCC CCGAACCCNA 72 0 

ACCAATAAAA ACGGGGANGG TNGGTNG.VJC NNCCTGACCC AAAAATAATG GATCCCCCGG 780 
GCTGCAGGAA TTCAATTCAN CCTTATCNAT ACCCCCAACN NGGNGGGGGG GGCCNCTNCC 
CATTNCCCCT NTATTNATTC TTTNNCCCCC CCCCCGGCNT CCTTTTTNAA CTCGTGAAAG 
NCTTACCAAN TTATGNCCTG GACCNTCCCC TTCCNCGGTN ONTTANAAAA 
AAAAGCCCNC ANTCCCNTCC NAAATTTGCA CNGAAAGGNA AGGAATTTAA" CCTTTATTTT 
TTNNTCCTTT ANTTTGTNNN CCCCCTTTTA CCCAGGCGAA CNGCCATCNT TTAANAAAAA 
AAANAGAANG TTTATTTTTC CTTNGAACCA TCCCAATATJA AANCACCCGC NGGGGAACGG 
GGNGGNAGGC CNCTCACCCC CTTTNTGTNG GNGGGNG ■ ^^^.AACGG 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

- (A) X.ENGTH: 114 6 base- pairs 
(B) TYPE: nucleic acid - 
(C.) STRANDEDNESS : single ' 
(D) TOPOLOGY: linear ■ 



(xi> SEQUENCE DESCRIPTION SEQ ID NO : 9 : 



NCCNNTTNNT. GATGTTGTCT TTTTGGCCTC TCTTTGGATA- CTTTCCCTCT CTTCAGAGGT 
GAAAAGGGTC.AAAAGGAGCT GTTGACAGTC ATCCCAGGTG GGCCAATGTG TGCAGAGTAC 
AGACTCCATC AGTGAGGTCA AAGCCTGGGG CTTTTCAGAG AAGGGAGGAT TATGGGTTTT 
CCAATTATAC AAGTCAGAAG- TAGAAAGAAG GGACATAAAC CAGGAAGGGG GTGGAGCACT 
CATCACCCAG AGGGACTTGT GCCTCTCTCA GTGGTAGTAG AGGGGCTACT TCCTCCCACC 
ACGGTTGCAA CCAAGAGGCA ATGGGTGATG AGCCTACAGG GGACATATJCC GAGGAGACAT 
GGGATG-ACCC TAAGGGAGTA GOCTGGTTTT AAGGCGGTGG GACTGGGTGA GGGAAACTCT 
CCTCTTCTTC AGAGAGAAGC AGTACAGGGC GAGCTGAACC GGCTGAAGGT CGAGGCGAAA 
ACACGGTCTG GCTCAGGAAG ACCTTGGAAG TAAAATTATG AATGGTGCAT GAATGGAGCC 
ATGGAAGGGG TGCTCCTGAC CAAACTCAGC CATTGATCAA TGTTAGGGAA ACTGATCAGG 
GAAGCCGGGA ATTTCATTAA CAACCCGCCA CACAGCTTGA ACATTGTGAG GTTCAGTGAC 
CCTTCAAGGG GCCACTCCAC TCCAACTTTG GCCATTCTAC TTTGCNAAAT TTCCAAAACT 
TCCTTTTTTA AGGCCGAATC CNTANTCCCT NAAAAACNAA AAAAAATCTG CNCCTATTCT 
GGAAAAGGCC CANCCCTTAC CAGGCTGGAA GAAATTTTNC CTTTTTTTTT TTTTTGAAGG 
CNTTTNTTAA ATTGAACCTN AATTCNCCCC CCCAAAAAAA AACCCNCCNG GGGGGCGGAT 
TTCCAAAAAC NAATTCCCTT ACCAAAAAAC AAAAACCCNC CCTTNTTCCC TTCCNCCCTN 
TTCTTTTAAT TAGGGAGAGA TNAAGCCCCC CAATTTCCNG GNCTNGATNN GTTTCCCCCC 
CCCCCATTTT CCNAAACTTT TTCCCANCNA GGAANCCNCC CTTTTTTTNG GTCNGATTNA 
NCAACCTTCC AAACCATTTT TCCNNAAAAA NTTTGNTNGG -NGGGAAAAAN ACCTNNTTTT 



60 



180 
240 
300 
360 
420 
480 
540 
600 



840 

900 
960 
1020 
1080 
1140 
1177 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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ATAG Aia 114 6 

(2) INFORMATION FOR SEQ ID NO : 1 0 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI^JDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi> SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 



CTTCATTGGG TACGGGCCCC CTCGAGGTCG ACGGTATCGA TAAGCTTGAT ATCGAATTCC 6 0 

TGCAGCCCGG GGGATCCACT AGTTCTAGAG TCAGGAAGAA CCACCAACCT TCCTGATTTT 12 0 

TATTGGCTCT GAGTTCTGAG GCCAGTTTTC TTCTTCTGTT GAGTATGCGG GATTGTCAGG 180 

CAGATCTGGC TGTGGAAAGG AGACTGTGGG CAGCAAGTTT AGAGGCGTGA CTGAAAGTCA 24 0 

CACTGCATCT TGAGCTGCTG AATCAGCTTT CTGGTTACCA CGGGCAACAG C^rGTGTTTTC 3 00 

CTTTTGATGT CCTTTACAGT GGATTACAGC CACCTGCTGA GGTGAGTAGC CCACGCTCCT 3 60 

GGTAGATGGC TCCACGTACA TGCACAGTAG CA/vAGGCGTA CCTGCTGTCA GTGTTAACGT 42 0 

TAATATCCTT ACCCCATCGG AGAGCCTGAG TGAGGGCGAT CAATTCAGCC CTTTTGTGCT 4 80 

GAGGTGTTTG CTGGTTAAGC CCTGAACCCA CAACACATCT GTCTCCATGG TAACAGCTGC 54 0 

ACCGG 54 5 



(2) INFORMATION FOR SEQ ID NO : I 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



TCTCCTAGGC TGGGCACAGT GGCTCATACC TGTAATCCTG ACCGTTTCAG AGGCTCAGGT 60 

GGGGGGATCG CTTGAGCCCA AGATTTCAAG ACTAGTCTGG GTAACATAGT GAGACCCTAT 12 0 

CTCTACGAAA AAATAAA/\AA ATGAGCCTGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 180 

AATCGAGCCT AGGAGA 196 



(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



TCTCCTAGGC TTGGGGGCTC TGACTAGAAA TTCAAGGAAC CTGGGATTCA AGTCCAACTG 60 

TGACACCAAC TTACACTGTG GNCTCCAATA AACTGCTTCT TTCCTATTCC CTCTCTATTA 12 0 

AATAAAATAA GGAAAACGAT GTCTGTGTAT AGCCAAGTCA GNTATCCTAA AAGGAGATAC 180 

TAAGTGACAT TAAATATCAG AATGTAAAAC CTGGGAACCA GGTTCCCAGC CTGGGATTAA 240 
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ACTGACAGCA AGAAGACTGA ACAGTACTAC TGTGAAAAGC CCGAAGNGGC AATATGTTCA 3 00 

CTCTACCGTT GAJKGGATGGC TGGGAGAATG AATGCTCTGT CCCCCAGTCC CAAGCTCACT 3 60 

TACTATACCT CCTTTATAGC CTAGGAGA 3 88 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ! single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

TAGTAGTTGC CTATAATCAT GTTTCTCATT ATTTTCACAT TTTATT'AACC AATTTCTGTT 6 0 

TACCCTGAAA AATATGAGGG AAATATATGA AACAGGGAGG CAATGTTCAG ATAATTGATC lAO 

ACAAGATATG ATTTCTACAT CAGATGCTCT TTCCTTTCCT GTTTATTTCC TTTTTATTTC 180 

GGTTGTGGGG TCGAATGTAA TAGCTTTGTT TCAAGAGAGA GTTTTGGCAG TTTCTGTAGC 24 0 

TTCTGACACT GCTCATGTCT CCAGGCATCT ATTTGCACTT TAGGAGGTGT CGTGGGAGAC 300 

TGAGAGGTCT ATTTTTTCCA TATTTGGGCA ACTACTA . ' 337 

(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

TAGTAGTTGC CATACAGTGC CTTTCCATTT ATTTAACCCC CACCTGAACG GCATAAACTG 6 0 

AGTGTTCAGC TGGTGTTTTT TACTGTAAAC AATAAGGAGA CTTTGCTCTT CATTTAAACC 12 0 

AAAATCATAT TTCATATTTT ACGCTCGAGG GTTTTTACCG GTTCCTTTTT ACACTCCTTA 180 

AAACAGTTTT TAAGTCGTTT GGAACAAGAT ATTTTTTCTT TCCTGGCAGC TTTTAACATT 24 0 

ATAGCAAATT TGTGTCTGGG GGACTGCTGG TCACTGTTTC TCACAGTT.GC AAATCAAGGC 3 00 

ATTTGCAACC AAGAAAAAAA AATTTTTTTG TTTTATTTGA AACTGGACCG GATAAACGGT 3 60 

GTTTGGAGCG GCTGCTGTAT ATAGTTTTAA ATGGTTTATT GCACCTCCTT AAGTTGCACT 42 0 

TATGTGGGGG GGGGNTTTTG NATAGAAAGT NTTTANTCAC ANAGTCACAG GGACTTTTNT 4 80 

CTTTTGGNNA CTGAGCTAAA AAGGGCTGNT TTTCGGGTGG GGGCAGATGA AGGCTCACAG 54 0 

GAGGCCTTTC TCTTAGAGGG GGGAACTNCT A 571 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 15: 
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TATATATTTA 
TAAAAGTATT 
TCCCCCACCC 
AAGTCTTTGG 
TGGCTGTGCA 
CTCCCAGGTT 
CATCACCATG 
GGCCAGGNTG 
GCTAGGATTA 
AACTACTA 



ATAACTTAAA TATATTTTGA 
TCCAAAAAGC ATAAAACCAA 
GCACTGAAAC TTCACCTTCT 
TGCGTGCTCA CTACTCTTTT 
GCCCAGGGGT GGAGTACAAT 
CATGAGATTC TCCTGNTTCA 
CCTGGNTAAT CTTTTTTNGT 
GTNTCGAACT CCTGACCTCA 
CAGACATGAG CCACTGNGCC 



TCACCCACTG GGGTGATAAG ACAATAGATA 
AGTATCATAC CAAACCAAAT TCATACTGCT 
AACTGTCTAC CTAACCAAAT TCTACCCTTC 
TTTTTTTTTT TTTNTTTTGG AGATGGAGTC 
GGCACAACCT CAGCTCACTG NAACCTCCGC 
GCCTTCCCAG TAGCTGGGAC TACAGGTGTG 
TTTNGGGTAG AGATGGGGGT TTTACATGTT 
AGTGATCCAC CCACCTCAGG CTCCCAAAGT 
CAGNCCTGGT GCATGCTCAC TTCTCTAGGC 



60 



120 



180 



240 



300 
360 
420 



480 



540 
548 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



TTCCGTTATG 


CACATGCAGA 


ATATTCTATC 


GGTACTTCAG 


CTATTACTCA 


TTTTGATGGC 


60 


GCAATCCGAG 


CCTATCCTCA 


AGATGAGTAT 


TTAGAAAGAA 


TTGATTTAGC 


GATAGACCAA 


120 


GCTGGTAAGC 


ACTCTGACTA 


CACG/iAATTG 


TTCAGATGTG 


ATGGATTTAT 


GACAGTTGAT 


IBO 


CTTTGGAAGA 


GATTATTAAG 


TGATTATTTT 


AAAGGGAATC 


CATTAATTCC 


AGAATATCTT 


240 


GGTTTAGCTC 


AAGATGATAT 


AGAAATAGAA 


CAGAAAGAGA 


CTACAAATGA 


AGATGTATCA 


300 


CCAACTGATA 


TTGAAGAGCC 


TATAGTAGAA 


7UVTGAATTAG 


CTGCATTTAT 


TAGCCTTACA 


360 


CATAGCGATT 


TTCCTGATGA 


ATCTTATATT 


CAGCCATCGA 


CATAGCATTA 


CCTGATGGGC 


420 


AACCTTACGA 


ATAATAGAAA 


CTGGGTGCGG 


GGCTATTGAT 


GAATTCATCC 


NCAGTAAATT 


480 


TGGATATNAC 


AAAATATAAC 


TCGATTGCAT 


TTGGATGATG 


GAATACTAAA 


TCTGGCAAAA 


540 


GTAACTTTGG 


AGCTACTAGT 


AACCTCTCTT 


TTTGAGATGC 


AAAATTTTCT 


TTTAGGGTTT 


600 


CTTATTCTCT 


ACTTTACGGA 


TATTGGAGCA 


TAACGGGA 






638 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

ACTGATGGAT GTCGCCGGAG GCGAGGGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 6 0 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 12 0 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTGCT 24 0 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 2 86 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 base pairs 
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(B) 
(C) 
(D) 



TYPE: nucleic acid 
STRANDEDNESS : single 
TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TCGGTCATAG CAGCCCCTTC TTCTCAATTT CATCTGTCAC TACCCTGGTG TAGTATCTCA 
TAGCCTTACA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC 
CATATCACAC ATAACTGCAA GTAAACATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT 
GTGNCAAGAA ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 
TCACTCTTCA CCTATGACCG AA 



262 



240 



120 



180 



60 



(2) INFORMATION FOR SEO ID NO: 19: 

(i) SEQUENCE CHAR.ACTER ISTICS : 

(A) LENGTH: 261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 19: 

TCGGTCATAG CAAAGCCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 60 

ATTTATGATA AATGGTGGCA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT 12 0 

AACTCTGAGA TATATTCTTC TACAJTTAAA, CAATAAAAAT AAJCTATTTT TAAAAGCCTA 18 0 

ATTTGCGTAG TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 24 0 

CGTTTCTCTG. CCTATGACCG A .„ . 261 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

TACAACGAGG CGACGTCGGT AAAATCGGAC ATGAAGCCAC CGCTGGTCTT TTCGTCCGAG 60 

CGATAGGCGC CGGCCAGCCA GCGGAACGGT TGCCCGGATG GCGAAGCGAG CCGGAGTTCT 12 0 

TCGGACTGAG TATGAATCTT GTTGTGAAAA TACTCGCCGC CTTCGTTCGA CGACGTCGCG 180 

TCGAJU^TCTT CGANCTCCTT ACGATCGAAG TCTTCGTGGG CGACGATCGC GGTCAGTTCC 24 0 

GCCCCACCGA AATCATGGTT GAGCCGGATG CTGNCCCCGA AGNCCTCGTT TGTN 2 94 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 : 



TTGGTAAAGG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

ATCAATGAAT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 12 0 

GTTCTCATGG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CAACTAGTCG NCTTGCNANG ATCTTCAT 20 8 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 287 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi^ SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



NCCNTTGAGC TGAGTGATTG AGATNTGTAA TGGTTGTAAG GGTGATTCAG GCGGATTAGG 6 0 

GTGGCGGGTC ACCCGGCAGT GGGTCTCCCG ACAGGCCAGC AGGATTTGGG GCAGGTACGG . 12 0 

NGTGCGCATC GCTCGACTAT ATGCTATGGC AGGCGAGCCG TGGAAGGNGG ATCAGGTCAC 18 0 

GGCGCTGGAG CTTTCCACGG TCCATGNATT GNGATGGCTG TTCTAGGCGG CTGTTGCCAA 24 0 

GCGTGATGGT ACGCTGGCTG GAGCATTGAT TTCTGGTGCC AAGGTGG 2 87 



(2) INFORMATION FOR SEQ ID NO:23; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



TTGGGTAAAG GGAGCAAGGA GAAGGCATGG AGAGGCTCAl^J GCTGGTCCTG GCCTACGACT 6 0 

GGGCCAAGCT GTCGCCGGGG ATGGTGGAGA ACTG7VAGCGG GACCTCCTCG AGGTCCTCCG 12 0 

NCGTTACTTC NCCGTCCAGG AGGAGGGTCT TTCCGTGGTC TNGGAGGAGC GGGGGGAGAA 180 

GATNCTCCTC ATGGTCNACA TCCC 2 04 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

TGGATTGGTC AGGAGCGGGT AGAGTGGCAC CATTGAGGGG A.TATTCAAAA ATATTATTTT 60 

GTCCTAAATG ATAGTTGCTG AGTTTTTCTT TGACCCATGA GTTATATTGG AGTTTATTTT 120 

TTAACTTTCC AATCGCATGG ACATGTTAGA CTTATTTTCT GTTAATGATT NCTATTTTTA 180 
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TTAAATTGGA TTTGAGAAAT TGGTTNTTAT TATATCAATT TTTGGTATTT GTTGAGTTTG 24 0 

ACATTATAGC TTAGTATGTG ACCA 2 64 

(2) INFOPP^ATION FOR SEQ ID NO: 25: 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 25: 



TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 6 0 

TGCACCeCCA ATCCCAGCTA CTTGGGAGGT TGAGACACAA GA^^TCACCTA NATGTGGGAG 12 0 

GTCAAGGTTG CATGAGTCAT GATTGTGCCA CTGCACTCCA GCCTGGGTGA CAGACCGAGA 180 

CCCTGCCTCA ANAGANAANG AATAGGAAGT TCAGAAATCN TGGNTGTGGN GCCCAGCAAT 24 0 

CTGCATCTAT NCAACCCCTG CAGGCAANGC TGATGCAGCC TANGTTCAAG AGCTGCTGTT 3 00 

TCTGGAGGCA GCAGTTNGGG CTTCCATCCA GTATCACGGC CACACTCGCA Cl^AGCCATCT 3 60 

GTCCTCCGTN TGTNAC ' " ' • , ; 



(2) INFORr4ATION FOR SEQ ' ID ' NO : 2 ^ : ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



TTACAACGAG GGGAAACTCC GTCTCTACAA AAATTAAAAA ATTAGCCAGG TGTGGTGGTG 60 

TGCACCTGTA ATCCCAGCTA CTTGGGCGGC TGAGACACAA GAACCACCTA AATGTGGGAG 12 0 

GGTCAAGGTT GCATGAGTCA TGATCGCGCC ACTGCACTCC AGCCTGGGTG ACAGACTGAG 180 

ACCCTGCCTC AAAAGAAAAA GAATAGGAAG TTCAGAAACC CTGGGTGTGG NGCCCAGCAA 24 0 

TCTGCATTTA AACAATCCCT " GCAGGCAATG CTGATGCAGC CTAAGTTCAA GAGCTGCTGT 300 

TCTGGAGGCA GNAGTAAGGG CTTCCATCCA" GCATCACGGN CAACACTGCA AAAGCACCTG 3 60 

TCCTCGTTGG TA 3 72 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TTCTGTCCAC ATCTACAAGT TTTATTTATT TTGTGGGTTT TCAGGGTGAC TAAGTTTTTC 60 

CCTACATTGA AAAGAGAAGT TGCTAAAAGG TGCACAGGAA ATCATTTTTT TAAGTGAATA 12 0 

TGATAATATG GGTCCGTGCT TAATACAACT GAGACATATT TGTTCTCTGT TTTTTTAGAG 180 
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TCACCTCTTA AAGTCCAATC CCACAATGGT GAAAAAAAAA TAGAAAGTAT TTGTTCTACC 24 0 

TTTAAGGAGA CTGCAGGGAT TCTCCTTGAA .AACGGAGTAT GGAATCAATC TTAAATAAAT 3 00 

ATGAAATTGG TTGGTCTTCT GGGATAAGAA ATTCCCAACT CAGTGTGCTG AAATTCACCT 360 

GACTTTTTTT GGGAAAAAAT AGTCGAAAAT GTCAATTTGG TCCATAAAAT ACATGTTACT 42 0 

ATTAAAAGAT ATTTAAAGAC AAATTCTTTC AGAGCTCTAA GATTGGTGTG GACAGAA 4 77 



(2) INFORMATION FOR SEQ ID NO: 28: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 438 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(^i) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 



TCTNCAACCT 
ATTCCAGCAA 
CACAAATGCC 
AATCCGATAA 
GACGACTTCA 
GAGGAGATAC 
ATTAGCTCCC 
TTGATGTCAG 



CTTGANTGTC 

AATCCCTCTA 
AAATTAAGAG 
CCCTCCTGGA 
ATCGNCTTAG 
CGGTGGAAAT 
AATTGGGTCT 
GTTGAAGA 



AAAAACCTTN 

GTTTTTGGAG 
CATGGCTATT 
GGTGCTCTAA 
ACAAGTTTAT 
CGTCAAAAGT 
CTAATCACTA 



TAGGCTATCT 

TTTCCTTTTA 
TTCGGGGGCT 
AAACACTCCT 
AGGTTTCTGG 
TCTCCCTCCA 
TTCCTCTAGC 



CTAAAAGCTG 
CTATCTGGGG 
GACAGGTCAA 
GGTGACTCAI- 
GCAGCTCCCT 
CTTGAGAAAT 
TTCCTCCTCC 



ACTGGTATTC 
CTGCCTGAGC 
AAGGGGTGTA 
CATGCCCCTG 
GAATACCCAC 
TTGGGTCCCA 
GGNCTATTGG 



60 
12 0 
180 
240 
300 
360 
420 
438 



(2) INFORMATION FOR SEQ ID NO: 29. 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



AAGAGGGTAC CAGCCCCAAG CCTTGACAAC TTCCATAGGG TGTCAAGCCT GTGGGTGCAC 60 

AGAAGTCAAA AATTGAGTTT TGGGATCCTC AGCCTAGATT TCAGAGGATA TAAAGAAACA 12 0 

CCTAACACCT AGATATTCAG ACAAAAGTTT ACTACAGGGA TGAAGCTTTC ACGGAAAACC 180 

TCTACTAGGA AAGTACAGAA GAGAAATGTG GGTTTGGAGC CCCCAAACAG AATCCCCTCT 24 0 

AGAACACTGC CTAATGAAAC TGTGAGAAGA TGGCCACTGT CATCCAGACA CCAGAATGAT 3 00 

AGACCCACCA AAAACTTATG CCATATTGCC TATAAAACCT ACAGACACTC AATGCCAGCC 36 0 

CCATGAAAAA AAAACTGAGA AGAAGACTGT NCCCTACAAT GCCACCGGAG CAGAACTGCC 42 0 

CCAGGCCATG GAAGCACAGC TCTTATATCA ATGTGACCTG GATGTTGAGA CATGGAATCC 480 

NANGAAATCN TTTTAANACT TCCACGGTTN AATGACTGCC CTATTANATT CNGAACTTAN 54 0 

ATCCNGGCCT GTGACCTCTT TGCTTTGGCC ATTCCCCCTT TTTGGAATGG CTNTTTTTTT 600 

CCCATGCCTG TNCCCTCTTA 62 0 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRTPTTON: SEQ ID NO : 3 0 : 
TTACAACGAG GGGGTCAATG TCATAAATGT CACAATAAAA CAATCTCTTC TTTTTTTTTT 6 0 

(2) INFORMATION FOR SEQ ID NO : 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 



TAGTCTATGC GCCGGACAGA GCAGAATTAA ATTGGAAGTT GCCCTCCGGA' CTTTCTACCC . 6 0 

ACACTCTTCC TGAAAAGAGA AAGAAAAGAG GCAGGAAAGA GGTTAGGATT TCATTTTCAA 12 0 

GAGTCAGCTA ATTAGGAGAG CAGAGTTTAG ^AQAGCAGTAG GCACCCCATG ATACAAACCA 180 

TGGAGAAAGT CCCTGTTTAG TAACTGGCAG ACATGATCCT GCTCAGGTTT TGAAATCTCT 2 40 

CTGCCCATAA AAGATGGAGA GCAGGAGTGC CATCCACATG AACACGTGTC CAAGAAAGAG 3 00 

TCTCAGGGAG ACAAGGGTAT CAAAAAACAA GATTCTTAAT GGGAAGGAAA -TCAAAGGAAA 3 60 

AAATTAGATT TTTCTCTACA TATATATAAT ATACAGATAT TTAACACATT ATTCCAGAGG 4 20 

TGGCTCCAGT CCTTGGGGCT TGAGAGATGG TGAAAACTTT TGTTCCACAT TAACTTCTGC 4 80 

TCTCAAATTC TGAAGTATAT CAGAATGGGA CAGGCAATGT TTTGCTCCAC ACTGGGGCAC 54 0 

AGACCCAAAT GGTTCTGTGC CCGAAGAAGA GAAGCCCGAA AGACATGAAG GATGCTTAAG 6 00 

GGGGGTTGGG AAAGCCAAAT TGGTANTATC TTTTCCTCCT GCCTGTGTTC CNGAAGTCTC 66 0 

CNCTGAAGGA ATTCTTAAAA CCCTTTGTGA GGAAATGCCC CCTTACCATG ACAANTGGTC 72 0 

CCATTGCTTT TAGGGNGATG GAAACACCAA GGGTTTTGAT CC ^ 7 62 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: -li-near 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



TAGTCTATGC GTGTATTAAC CTCCCCTCCC TCAGTAACAA CCAAAGAGGC AGGAGCTGTT 60 

ATTACCAACC CCATTTTACA GATGCATCAA TAATGACAGA GAAGTGAAGT GACTTGCGCA 12 0 

CACAACCAGT AAATTGGCAG AGTCAGATTT GAATCCATGG. AGTCTGGTCT GCACTTTCAA 180 

TCACCGAATA CCCTTTCTAA GAAACGTGTG CTGAATGAGT GCATGGATAA ATCAGTGTCT 24 0 

ACTCAACATC TTTGCCTAGA TATCCCGCAT AGACTA 2 76 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 
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(C) STRA>JDEDrJESS : single 

(D) TOPOLOGY, linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 3 : 

TAGTAGTTGC CAAATATTTG AAAATTTACC CAGAAGTGAT TGAAAACTTT TTGGAAACAA 6 0 

AAACAAATAA AGCCAAAAGG TAAAATAAAA ATATCTTTGC ACTCTCGTTA TTACCTATCC 12 0 

ATAACTTTTT CACCGTAAGC TCTCCTGCTT GTTA.GTGTAG TGTGGTTATA TTA^U^CTTTT 180 

TAGTTATTAT TTTTTATTCA CTTTTCCACT AGAAAGTCAT TATTGATTTA GCACACATGT 24 0 

TGATCTCATT TCATTTTTTC TTTTTATAGG CAAAATTTGA TGCTATGCAA CAAAAATACT 3 00 

CAAGCCCATT ATCTTTTTTC CCCCCGAAAT CTGAAAATTG CAGGGGACAG AGGGAAGTTA 3 60 

TCCCATTAAA AAATTGTAAA TATGTTCAGT TTATGTTTAA AAATGCACAA AACATAAGAA 42 0 

AATTGTGTTT ACTTGAGCTG CTGATTGTAA GCAGTTTTAT CTCAGGGGCA ACTACTA 4 77 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 4 : 



TAGTAGTTGC 


CAATTCAGAT 


GATCAGAAAT 


GCTGCTTTCC 


TCAGCATTGT 


CTTGTTAAAC 


60 


CGCATGCCAT 


TTGGAACTTT 


GGCAGTGAGA 


AGCCAAAAGG 


AAGAGGTGAA 


TGACATATAT 


120 


ATATATATAT 


ATTCAATGAA 


AGTAAAATGT 


ATATGCTCAT 


ATACTTTCTA 


GTTATCAGAA 


180 


TGAGTTAAGC 


TTTATGCCAT 


TGGGCTGCTG 


CATATTTTAA 


TCAGAAGATA 


AAAGAAAATC 


240 


TGGGCATTTT 


TAGAATGTGA 


TACATGTTTT 


TTTAAAACTG 


TTAAATATTA 


TTTCGATATT 


300 


TGTCTAAGAA 


CCGGAATGTT 


CTTAAAATTT 


ACTAAAACAG 


TATTGTTTGA 


GGAAGAGAAA 


360 


ACTGTACTGT 


TTGCCATTAT 


TACAGTCGTA 


CPJ^GTGCATG 


TCAAGTCACC 


CACTCTCTCA 


420 


GGCATCAGTA 


TCCACCTCAT 


AGCTTTACAC 


ATTTTGACGG 


GGAATATTGC 


AGCATCCTCA 


480 


GGCCTGACAT 


CTGGGAAAGG 


CTCAGATCCA 


CCTACTGCTC 


CTTGCTCGTT 


GATTTGTTTT 


540 


AAAATATTGT 


GCCTGGTGTC 


ACTTTTAAGC 


CACAGCCCTG 


CCTAAAAGCC 


AGCAGAGAAC 


600 


AGAACCCGCA 


CCATTCTATA 


GGCAACTACT 


A 






631 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TAGTAGTTGC CATCCCATAT TACAGAAGGC TCTGTATACA TGACTTATTT GGAAGTGATC 6 0 

TGTTTTCTCT CCAAACCCAT TTATCGTAAT TTCACCAGTC TTGGATCAAT CTTGGTTTCC 12 0 

ACTGATACCA TGAAACCTAC TTGGAGCAGA CATTGCACAG TTTTCTGTGG TAAAAACTAA 18 0 

AGGTTTATTT GCTAAGCTGT CATCTTATGC TTAGTATTTT TTTTTTACAG TGGGG AATTG 24 0 

CTGAGATTAC ATTTTGTTAT TCATTAGATA CTTTGGGATA ACTTGACACT GTCTTCTTTT 3 00 

TTTCGCTTTT AATTGCTATC ATCATGCTTT TGAAACAAGA ACACATTAGT CCTCAAGTAT 360 
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TACATAAGCT TGCTTGTTAC GCCTGGTGGT TTAAAGGACT ATCTTTGGCC TCAGGTTCAC 
AAGAATGGGC AAAGTGTTTC CTTATGTTCT GTAGTTCTCA ATAAAAGATT GCCAGGGGCC 
GGGTACTGTG GCTCGCACTG TAATCCCAGC ACTTTGGGAA GCTGAGGCTG GCGGATCATG 
TTAGGGCAGG TGTTCGAAAC CAGCCTGGGC AACTACTA 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: linear 



_(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TAGTAGTTGC CTGTAATCCC AGCAACTCAG GAGGCTGGGG CAGGAGAATC AGTTGAACCT 6 0 

GGGAGGCAGA AGTTGTAATT AGCAAAGATC - GCACCATTGC ACTTCAGCCT GGGCAACAAG l'>0 

AGTGAGATTC CATCTCAAAA ACAAAAAAAA GAAAAAGAAA AGAAAAGGAA AAAACGTATA 180 

AACCCAGCCA AAACAAAATG ATCATTCTTT TAATAAGCAA GACTAATTTA ATGTGTTTAT 24 0 

TTAATCAAAG CAGTTGAATC TTCTGAGTTA TTGGTGAAAA TACCCATGTA GTTAATTTAG 300 

GGTTCTTACT TGGGTGAACG TTTGATGTTC ACAGGTTATA AAATGGTTAA CAAGGAAAAT 36 0 

GATGCATAAA GAATCTTATA AACTACTAAA AATAAATAAA ATATAAATGG ATAGGTGCTA 42 0 

TGGATGGAGT TTTTGTGTAA TTTAAAATCT TGAAGTCATT TTGGATGCTC ATTGGTTGTC 480 

TGGTAATTTC CATTAGGAAA AGGTTATGAT ATGGGGAAAC TGTTTCTGGA AJ^TTGCGGAA "^4 0 

TGTTTCTCAT CTGTAAAATG CTAGTATCTC AGGGCAACTA CTA 58 3 

(2) INFORMATION' FOR SEQ ID NO : 3 7 : 

(i) SEQUENCE CHARACTERISTICS: - ' 

' (A) LENGTH: 716 base pairs 
. . (3) TYPE: nucleic acid 

(C) STRANDEDNESS: single - 
■' . (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

GATCTACTAG TCATNTGGAT TCTATCCATG GCAGCTAAGC CTTTCTGAAT GGATTCTACT 6 0 

GCTTTCTTGT TCTTTAATCC AGACCCTTAT ATATGTTTAT GTTCACAGGC AGGGCAATGT 12 0 

TTAGTGAAAA CAATTCTAAA TTTTTTATTT TGCATTTTCA TGCTAATTTC CGTCACACTC 18 0 

CAGCAGGCTT CCTGGGAGAA TAAGGAGAAA TACAGCTAAA GACATTGTCC CTGCTTACTT 24 0 

ACAGCCTAAT GGTATGCAAA ACCACTTCAA TAAAGTAACA GGAAAAGTAC TAACCAGGTA 3 00 

GAATGGACCA AAACTGATAT AGAAAAATCA GAGGAAGAGA GGAACAAATA TTTACTGAGT 3 60 

CCTAGAATGT ACAAGGCTTT TTAATTACAT ATTTTATGTA AGGCCTGCAA AAAACAGGTG 4 20 

AGTAATCAAC ATTTGTCCCA TTTTACATAT AAGGAAACTG AAGCTTAAAT TGAATAATTT 4 80 

AATGCATAGA TTTTATAGTT AGACCATGTT CAGGTCCCTA TGTTATACTT ACTAGCTGTA 54 0 

TGAATATGAG AAAATAATTT TGTTATTTTC TTGGCATCAG TATTTTCATC TGCAAAA^AA 6 00 

AGCTAAAGTT ATTTAGCAAA CAGTCAGCAT AGTGCCTGAT ACATAGTAGG TGCTCCAAAC 660 

ATGATTACNC TANTATTNGG^ TATTANAAAA ATCCAATATA GGCNTGGATA AAACCG 716 



(2) INFORMATION FOR SEQ ID N0^38:^ 

(i) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 6 88 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 



TTCTGTCCAC 


ATATCATCCC 


ACTTTAATTG 


TTAATCAGCA 


AAACTTTCAA 


TGAAAAATCA 


60 


TCCATTTTAA 


CCAGGATCAC 


ACCAGGAAAC 


TGAAGGTGTA 


TTTTTTTTTA 


CCTTAAAAAA 


120 


AAAAAAAAAA 


ACCAAACAAA 


CCAAAACAGA 


TTAACAGCAA 


AGAGTTCTAA 


AAAATTTACA 


180 


TTTCTCTTAC 


AACTGTCATT 


CAGAGAACAA 


TAGTTCTTAA 


GTCTGTTAAA 


TCTTGGCATT 


240 


AACAGAGAAA 


CTTGATGAAN 


AGTTGTACTT 


GGAATATTGT 


GGATTTTTTT 


TTTTGTCTAA 


300 


TCTCCCCCTA 


TTGTTTTGCC 


AACAGTAATT 


TAAGTTTGTG 


TGGAACATCC 


CCGTAGTTGA 


360 


AGTGTAAACA 


ATGTATAGGA 


AGGAATATAT 


GATAAGATGA 


TGCATCACAT 


ATGCATTACA 


420 


TGTAGGGACC 


TTCACAACTT 


CATGCACTCA 


GAAAACATGC 


TTGAAGAGGA 


GGAGAGGACG 


480 


GCCCAGGGTC 


ACCATCCAGG 


TGCCTTGAGG 


ACAGAGAATG 


CAGAAGTGGC 


ACTGTTGAAA 


540 


TTTAGAAGAC 


CATGTGTGAA 


TGGTTTCAGG 


CCTGGGATGT 


TTGCCACCAA 


GAAGTGCCTC 


600 


CGAGAAATTT 


CTTTCCCATT 


TGGAATACAG 


GGTGGCTTGA 


TGGGTACGGT 


GGGTGACCCA 


660 


ACGAAGAAAA 


TGAAATTCTG 


CCCTTTCC 








688 



(2) INFORMATION FOR SEQ ID NO : 3 9 : 

(i) SEQUENCE CHAJIACTERISTICS : 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 39: 



TAGTAGTTGC 


CGCNNACCTA 


AAANTTGGAA 


AGCATGATGT 


CTAGGAAACA 


TANTAAAATA 


60 


GGGTATGCCT 


ATGTGCTACA 


GAGAGATGTT 


AGCATTTAAA 


GTGCATANTT 


TTATGTATTT 


120 


TGACAAATGC 


ATATNCCTCT 


ATAATCCACA 


ACTGATTACG 


AAGCTATTAC 


AATTAAAAAG 


180 


TTTGGCCGGG 


CGTGGTGGGC 


GGTGGCTGAC 


GCCTGTAATC 


CCAGCACTTT 


GGGAGGCCGA 


240 


GGCACGCGGA 


TCACGAGGTC 


GGGAGTTCAA 


GACCATCCTG 


GCTAACACGG 


TGAAAGTCCA 


300 


TCTCTACTAA 


AAATACGAAA 


AAATTACCCC 


GGCGTGGTGG 


CGGGCGCCTG 


TAGTCCCAGC 


360 


TACTCCGGAG 


GCTGAGGCAG 


GAGAATGGCG 


TGAACCCAGG 


ACACGGAGCT 


TGCAGTGTGC 


420 


CAACATCACG 


TCACTGCCCT 


CCAGCCTGGG 


GGACAGGAAC 


AAGANTCCCG 


TCCTCANAAA 


480 


AGAAAAATAC 


TACTNATANT 


TTCNACTTTA 


TTTTAANTTA 


CACAGAACTN 


CCTCTTGGTA 


540 


CCCCCTTACC 


ATTCATCTCA 


CCCACCTCCT 


ATAGGGCACN 


NCTAA 




585 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TCTGTCCACA CCAATCTTAG AAGCTCTGAA AAGAATTTGT CTTTAAATAT CTTTTAATAG 6 0 

TAACATGTAT TTTATGGACC AAATTGACAT TTTCGACTGT TTTTTCCAAA AAAGTCAGGT 12 0 
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GAATTTCAGC ACACTGAGTT GGGAATTTCT TATCCCAGAA GACCAACCAA TTTCATATTT 
ATTTAAGATT GATTCCATAC TCCGTTTTCA AGGAGAATCC CTGCAGTCTC CTTAAAGGTA 
GAACAAATAC TTCCTATTTT TTTTTCACCA TTGTGGGATT GGACTTTAAG AGGTGACTCT 
AAAAAAACAG AGAACAAATA TGTCTCAGTT GTATTAAGCA CGGACCCATA TTATCATATT 
CACTTAAAAA AATGATTTCC TGTGCACCTT TTGGCAACTT CTCTTTTCAA TGTAGGGAAA 
AACTTAGTCA CCCTGAAAAC CCACAAAATA AATAAAACTT GTAGATGTGG ACAGA 

(2) INFORMATION FOR SEQ ID NO : 4 1 : 



(i) SEQUEWCE CHARACTERISTICS: 

(A) LENGTH: 423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ■ ID NO : 4 1- : 

TAAGAGGGTA CATCGGGTAA GAACGTAGGC ACATCTAGAG CTTAGAGAAG TCTGGGGTAG 6 0 

GAAAAAAATC TAAGTATTTA TAAGGGTATA GGTAACATTT AAAAGTAGGG CTAGCTGACA 12 0 

TTATTTAGAA AGAACACATA CGGAGAGATA AGGGCAAAGG ACTAAGACGA QAGGAACACT 18 0 

AATATTTAGT GATCACTTCC ATTCTTGGTA AAAATAGTAA CTTTTAAGTT AGCTTCAAGG 24 0 

AAGATTTTTG GCCATGATTA GTTGTCAAAA GTTAGTTCTC TTGGGTTTAT ATTACTAATT 300 

TTGTTTTAAG ATCCTTGTTA GTGCTTTAAT AAAGTCATGT TATATGAAAC GCTCTAAAAC 360 

ATTGTAGCAT GTTAAATGTC ACAATATACT TACCATTTGT TGTATATGGC TGTACCCTCT 4-^0 

^'^^ ■ . 423 
(2) INFORMATION FOR SEQ ID NO: 42: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: . single 
(D.) TOPOLOGY:, linear 



(xi) SEQUENCE D.ESCRIPTION: SEQ ID NO: 42: 



TCTCCTAGGC TAATGTGTGT GTTTCTGTAA AAGTAAAAAG TTAAAAATTT TAAAAATAGA 6 0 

AAAAAGCTTA TAGAATAAGA ATATGAAGAA AGAAAATATT TTTGTACATT TGCACAATGA 12 0 

GTTTATGTTT TAAGCTAAGT GTTATTACAA AAGAGCCAAA AAGGTTTTAA AAATTAAAAC 180 

GTTTGTAAAG TTACAGTACC CTTATGTTAA TTTATAATTG AAGAAJ^GAAA AACTTTTTTT 24 0 

TATAAATGTA GTGTAGCCTA AGCATACAGT ATTTATAAAG TCTGGCAGTG TTCAATAATG 3 00 

TCCTAGGCCT TCACATTCAC TCACTGACTC ACCCAGAGCA ACTTCCAGTC CTGTAAGCTC 3 60 

CATTCGTGGT AAGTGCCCTA TACAGGTGCA CCATTTATTT TACAGTATTT TTACTGTACC 42 0 

TTCTCTATGT TTCCATATGT TTCGATATAC AAATACCACT GGTTACTATN GCCCNACAGG 4 80 

TAATTCCAGT AACACGGCCT GTATACGTCT G^STANCCCTA GNGAAGA 527 

(2) INFORMATION FOR SEQ ID NO: 43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEC ID NO : 4 3 : 



TCTTCAACCT CGTAGGACAA CTCTCATATG CCTGGGCACT ATTTTTAGGT TACTACCTTG 60 

GCTGCCCTTC TTTAAGAAAA AAAAAAGAAG AAAAAAGAAC TTTTCCACAA GTTTCTCTTC 12 0 

CTCTAGTTGG AAAATTAGAG AAATCATGTT TTTAATTTTG TGTTATTTCA GATCACAAAT 180 

TCAAACACTT GTAAACATTA AGCTTCTGTT CAATCCCCTG GGAAGAGGAT TCATTCTGAT 24 0 

ATTTACGGTT CAAAAGAAGT TGTAATATTG TGCTTGGAAC ACAGAGAACC AGTTATTAAC 3 00 

TTCCTACTAC TATTATATAA TAAATAATAA C 331 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 592 base pairs 

(B) TYPE: nucleic acid 

(C) STRJ^J^DEDNESS : single 

(D) TOPOLOGY: linear 



ixx) SEQUENCE DESCRIPTION: SEQ ID NO:44: 



GGCTTAGTAG TTCCCAGGCA AAATARCGTT GATTCTCCTC AGGAGCCACC CCC;iACACCC 6 0 

CTGTTTGCTT CTAGACCTAT ACCTAGACTA AAGTCCCAGC AGACCCCTAG AGGTGAGGTT 12 0 

CAGAGTGACC CTTGACGAGA TGTGCTACAC TAGAAAAGAA CTGCTTGAGT TTTCTAATTT 180 

ATATAAGCAG AAATCTGGAG AAGAGTCATA GGAATGGATA TTA.ZVGGGTGT GAGATAATGG 24 0 

CGGAAGGAAT ATAGAGTTGG ATCAGGCTGG ACTTATTGAT TTGAACCCAC TAAGTAGAGA 3 00 

TTCTGCTTTT GATGTTGCAG CTCAGGGAGT TAAAAAAGGT TTTAATGGTT CTAATAGTTT 3 60 

ATTTGCTTGG TTAGCTGAAA TATGGATAAA AGATGGCCCA CTGTGAGCAA GCTGGAAATG 42 0 

CCTGATCTCT CTCAGTTTAA TGTAGAGGAA GGGATCCAAA AGTTTAGGGA GANTTGGATG 4 80 

CTGGRAKTGG ATTGGTCACT TTGRGACCTA CCCVITCCCAG CTGGGAGGGT CCAGAAGATA 54 0 

CACCCTTGAC CAACGCTTTG CGAAATGGAT TTGTGATGGC GGCAACTACT AA 5 92 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHAPJVCTERISTICS : 

(A) LENGTH: 567 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



GGCTTAGTAG TTGCCATTGC GAGTGCTTGC TCAACGAGCG TTGAACATGG CGGATTGTCT 6 0 

AGATTCAACG GATTTGAGTT TTACCAGCAA AGCGAACCAA GCGCGGCCCA GAGAATTATG 12 0 

GGTTGGTTGG CTTTGAAAAG ATGGAAATCC TGTAGGCCTA GTCAGAAAAG CCTTCTTGCA 18 0 

GAACAGTTGG TTCTCGGGCG AACGCTCATC AAGATGCCCA TTGGAAAGGC TAGCGTGTAT 24 0 

TTGGGAGAGC CTGATAGCGT GTCTTCTGAT GATGTTTGTG CTTGGACAGT GACAAAAGAT 3 00 

ATGCAAAGCA AGTCCGAACT AGACGTCAAG CTTCGTGAGC AAATTATTGT AGACTCCTAC 3 60 

TTATACTGTG AGGAATGATA GCCAAGGGTG GGGACTTTAA GACTAAGGTG GTTTGTACTT 420 

GCGCCGATGA TCCCAGGCAG AAAGAMCTGA TCGCTAGn^TT TATACGGGCA ACTACTAAGC 4 80 

CGAATTCCAG CACACTGGCG GCCGTTACTA ATTOGATCCG ANCTCGGTAC CAGCTTGATG 54 0 

CATASCTTGA GTTWTCTATA NTGTCNC 567 
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(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DE3CRI FTICi^ : SEQ ID NO: 46: 

GAGCGAAAGA CCGAGGGCAG NGNNTANGNG CGANGAAGCG GAGAGGGCCA AAAAGCAACC 6 0 

GCTTTCCCCG GGGGGTGCCG ATTCATTAAG GCAGGTGGAG GACAGGTTTC CCGATGGAAG 12 0 

GCGGCAGGGG CGCAAGCAAT TAATGTGAGT AGGCCATTCA TTAGCACCCG GGCTTAACAT 18 0 

TTAAGCTTCG GGTTGGTATG TGGTGGGAAT TGTGAGCGGA TAACAATTTC ACACAGGAAA 24 0 

CAGCTATGAC CATGATTACG CCAAGCTATT TAGGTGACAT TATAGAATAA CTCAAGTTAT 3 00 

GCATCAAGCT TGGTACCGAG TTCGGATCCA CTAGTAACGG CCGCCAGTGT GTGGAATTCG 3 60 

GCTTAGTAGT TGCCGACCAT GGAGTGCTAC CTAGGGTAGA ATACCTGAGY TCCTCCCTAG 4 20 

CCTCACTCAC ATTAAATTGT ATCTTTTCTA CATTAGATGT CCTCAGCGCC TTATTTCTGC 4 80 

TGGACWATCG ATAAATTAAT CCTGATAGGA TGATAGCAGC AGATTAATTA CTGAGAGTAT 54 0 

GTTAATGTGT CATCCCTCCT ATATAACGTA TTTGCATTTT AATGGAGCAA TTCTGGAGAT 6 00 

AATCCCTGAA GGCAAAGGAA TGAATCTTGA GGGTGAGAAA GCCAGAATCA GTGTCCAGCT 6 60 

GCAGTTGTGG GAGAAGGTGA TATTATGTAT GTCTCAGAAG TGACAGCATA TGGGCAACTA 72 0 

CTAAGCCCGA ATTCCAGCAC ACTGGCGGGC GTTACTAATG GATCCGAGCT CGGTACCAAG 7 80 

CTTGATGCAT AGCTTGAGTA TCTATAGTGT CACTAAATAG CGTGGCGTTA TCATGGTCAT 84 0 

AGCTGTTTCC TGTGTGAAAT TGTTATCCGC TCCCAATTCC CCCCACCATA CGAGCCGGAA 90 0 

CATAAAGT • . ' ' ' • 908 

(2) INFORMATION FOR SEQ ID N0:47:- 

(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

TGCCAACAAG GAAAGTTTTA AATTTCCCCT TGAGGATTCT TGGTGATCAT CAAATTCAGT 6 0 

GGTTTTTAAG GTTGTTTTCT GTCAAATAAC TCTAACTTTA AGCCAAACAG TATATGGAAG 12 0 

CACAGATAKA ATATTACACA GATAAAAGAG GAGTTGATCT AAAGTARAGA TAGTTGGGGG 180 

CTTTAATTTC TGGAACCTAG GTCTCCCCAT CTTCTTCTGT GCTGAGGAAC TTCTTGGAAG 2 40 

CGGGGATTCT AAAGTTCTTT GGAAGACAGT TTGAAAACCA CCATGTTGTT CTCAGTACCT 3 00 

TTATTTTTAA AAAGTAGGTG AACATTTTGA GAGAGAAAAG GGCTTGGTTG AGATGAAGTC 3 60 

-CCCCCCCCCC CTTTTTTTTT TTTTAGCTGA AATAGATACC CTATGTTNAA RGAARGGATT 42 0 

ATTATTTACC ATGCCAYTAR SCACATGCTC TTTGATGGGC NYCTCCSTAC CCTCCTTAAG 4 80 

(2) INFORMATION FOR SEQ ID: NO: 48: 

\ Ci) SEQUENCE CHARACTERISTICS': ' ' 

(A) LENGTH: 591 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 : 



AAGAGGGTAC CGAGTGGAAT TTCCGCTTCA CTAGTCTGGT GTGGCTAGTC GGTTTCGTGG 60 

TGGCCAACAT TACGAACTTC CAACTCAACC GTTCTTGGAC GTTCAAGCGG GAGTACCGGC 12 0 

GAGGATGGTG GCGTGAATTC TGGCCTTTCT TTGCCGTGGG ATCGGTAGCC GCCATCATCG 180 

GTATGTTTAT CAAGATCTTC TTTACTAACC CGACCTCTCC GATTTACCTG CCCGAGCCGT 24 0 

GGTTTAACGA GGGGAGGGGG ATCCAGTCAC GCGAGTACTG GTCCCAGATC TTCGCCATCG 3 00 

TCGTGACAAT GCCTATCAAC TTCGTCGTCA ATAAGTTGTG GACCTTCCGA ACGGTGAAGC 36 0 

ACTCCGAAAA CGTCCGGTGG CTGCTGTGCG GTGACTCCCA AAATCTTGAT AACAACAAGG 4 20 

TAACCGAATC GCGCTAJvGGA ACCCCGGCAT CTCGGGTACT CTGCATATGC GTACCCCTTA 4 80 

AGCCGAATTC CAGCACACTG GCGGCCGTTA CTAATTGGAT CCGAACTCCG TAACCAAGCC 54 0 

TGATGCGTA-A CTTGAGTTAT TCTATAGTGT CCCTAAAATA ACCTGGCGTT A 5 91 

(2) INFORMATION FOR SEQ ID NO r 4 9 : 



{ i ) SEQUENCE CIIARACTERISTIC3 : 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJnIDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 



AAGAGGGTAC CTGCCTTGAA ATTTAAATGT CTAAGGAAAR TGGGAGATGA TTAAGAGTTG 6 0 

GTGTGGCYTA GTCACACCAA AATGTATTTA TTACATCCTG CTCCTTTCTA GTTGACAGGA 12 0 

AAGAAAGCTG CTGTGGGGAA AGGAGGGATA AATACTGAAG GGATTTACTA AACAAATGTC 180 

CATCACAGAG TTTTCCTTTT TTTTTTTTTG AGACAGAGTC TTGCTCTGTC ACCCAGGCTG 24 0 

GAATGAAGWG GTATGATCTC AGTTGAATGC AACCTCTACC TCCTAGGTTC AAGCGATTCT 3 00 

CATGCCTCAG CCTCCTGAGC AGCTGGGACT ATAGGCGCAT GCTACCATGC CAGGCTAATT 3 60 

TTTATATTTT TATTAGAGAC GGGGTGTTGC CATGTTGGCC AGGCAGGTCT CGAACTCCTG 42 0 

GGCCTCAGAT GATCTGCCCC ACCGTACCCT CTTA 454 

(2) INFORMATION FOR SEQ ID NO: 50: . 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 463 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

AAGAGGGTAC CAAAAAAAAG AAAAAGGAAA AAAAGAAAAA CAACTTGTAT AAGGCTTTCT 60 

GCTGCATACA GCTTTTTTTT TTTAJ^TAAA TGGTGCCAAC AAATGTTTTT GCATTCACAC 12 0 

CAATTGCTGG TTTTGAAATC GTACTCTTCA AAGGTATTTG TGCAGATCAA TCCAATAGTG 18 0 

ATGCCCCGTA GGTTTTGTGG ACTGCCCACG TTGTCTACCT TCTCATGTAG GAGCCATTGA 24 0 

GAGACTGTTT GGACATGCCT GTGTTCATGT AGCCGTGATG TCCGGGGGCC CTGTACATCA 3 00 

TGTTACCGTG GGQTGGGCTC TGCATTGGCT GCTGGGCATA TGGCTGGGTG CCCATCATGC 3 60 

CCATCTGCAT CTGCATAGGG TATTGGGGCG TTTGATCCAT ATAGCCATGA TTGCTGTGGT 420 

AGCCACTGTT CATCATTGGC TGGGACATGC TGTTACCCTC TTA 4 63 
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(2) INFORMATION FOR SEQ ID NO:51: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CTTCAACCTC CCAAAGTGCT GGGATTACAG GACTGAGCCA CCACGCTCAG CCTAAGCCTC 60 

TTTTTCACTA CCCTCTAAGC GATCTACCAC AGTGATGAGG GGCTAAAGAG CAGTGCAATT 12 0 

TGATTACAAT AATGGAACTT AGATTTATTA ATTAACAATT TTTCCTTAGC ATGTTGGTTC 180 

CATAATTATT AAGAGTATGG ACTTACTTAG AT^TGAGCTT TCATTTTA^IG AATTTCATCT 24 0 

TTGACCTTCT CTATTAGTCT GAGCAGTATG ACACTATACG TATTTTATTT AACTAACCTA 3 00 

CCTTGAGCTA TTACTTTTTA AAAGGCTATA TACATGAATG TGTATTGTCA ACTGTAAAGC 3 60 

CCCACAGTAT TTAATTATAT CATGATGTCT TTGAGGTTG ' 399 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 92 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: -single 

(D) TOPOLOGY: linear 



(xi): SEQUENCE DESCRIPTION : SEQ ID NO: 52: 

CTTCAACCTC AATCAACCTT GGTAATTGAT AAAATCATCA CTTAACTTTC TGATATAATG 60 

GCAATAATTA TCTGAGAAAA AAAAGTGGTG AAAGATTAAA CTTGCATTTC TCTCAGAATC 12 0 

TTGAAGGATA ^TTGAATAAT TCAAAAGCGG AATCAGTAGT ATCAGCCGAA GAAACTCACT 180 

TAGCTAGAAC GTTGGACCCA TGGATCTAAG TCCCTGCCCT TCCACTAACC ' AGCTGATTGG 24 0 

TTTTGTGTAA ACCTCCTACA CGCTTGGGCT TGGTCGCCTC ATTTGTCAAA GTAAAGGCTG 3 00 

AAATAGGAAG ATAATGAACC GTGTCTTTTT GGTCTCTTTT CCATCCATTA CTCTGATTTT 36 0 

ACAAAGAGGC CTGTATTCCC CTGGTGAGGT TG 3 92 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) .SEQUENCE -DESCRIPTION: SEQ ID NO: 53: 

TTCGGGTGAT GCCTCCTCAG GCTACAGTGA AQACTGGATT ACAGAAAGGT GCCAGCGAGA 
TTTGAGATTC CTGTAAACCT CTAAAGAAAA CX3AGTCGCGC CTCAACTGAT GTAGAAATGA 
CTAGTTCAGC ATACNGAGAC ACNTCTGACT CCGATTCTAG AGGACTGAGT GACCTGCAN 

(2) INFORMATION FOR SEQ ID NO':54:" 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

TTCGGGTGAT GCCTCCTCAG GCTACATCAT NATAGAAGCA AAGTAGAANA ATCNNGTTTG 
TGCATTTTCC CACANACAAA ATTCAAATGA NTGGAAGAAA TTGGGANAGT AT 

(2) INFORMATION FOR SEQ ID NO: 55: 

. (i) SEQUENCE CHAPJVCTERISTICS : 

(A) LENGTH: 22^ base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

TGAGCTTCCG CTTCTGACAA CTCAATAGAT AATCAA.-\GGA CAACTTTAAC AGGGATTCAC 6 0 

AAAGGAGTAT ATCCAAATGC CAATAAACAT ATAAAAAGGA ATTCAGCTTC ATCATCATCA 12 0 

GAAGWATGCA AATTAAAACC ATAATGAGAA ACCACTATGT CCCACTAGAA TAGATAAAAT 180 

CTTAAAAGAC TGGTAAAACC AAGTGTTGGT AAGGCAAGAG GAGCA 22 5 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 i nea r 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

GCTCCTCTTG CCTTACCAAC ACATTCTCAA A7VACCTGTTA GAGTCCTAAG CATTCTCCTG 60 
TTAGTATTGG GATTTTACCC CTGTCCTATA AAGATGTTAT GTACCAAAAA TGAAGTGGAG 12 0 

GGCCATACCC TGAGGGAGGG GAGGGATCTC TAGTGTTGTC AGAAGCGGAA GCTCA 17 5 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 3 base pairs 
is) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
AGCCATTTAC CACCCATGGA TGAATGGATT TTGTAATTCT AGCTGTTGTA TTTTGTGAAT 6 0 
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TTGTTAATTT TGTTGTTTTT CTGTGAAACA CATACATTGG ATATGGGAGG TAAAGGAGTG 
TCCCAGTTGC TCCTGGTCAC TCCCTTTATA GCCATTACTG TCTTGTTTCT TGTAACTCAG 
GTTAGGTTTT GGTCTCTCTT GCTCCACTGC AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID MO; 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNEGS : Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

GTTCGAAGGT G.^^.CGTGTAG GTAGCGGATC TCAC.AACTGG GGAACTGTCA AAGACGAATT 

AACTGACTTG GATCAATCAA ATGTGACTGA GGAAACACCT GAAGGTGAAG AACATCATCC 

AGTGGCAGAC ACTGAAAATA AGGAGAATGA AGTTGAAGAG GTAAAAGAGG AGGGTCCAAA 

AGAGATGACT TTGGATGGGT GGTAAATGGC T 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single. 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GCTCCTCTTG CCTTACCAAC TTTGCACCCA TCATCAACCA TGTGGCCAGG TTTGCAGCCC 6 0 

AGGCTGCACA TCAGGGGACT GCCTCGCAAT ACTTCATGCT GTTGCTGCTG ACTGATGGTG 12 0 

CTGTGACGGA TGTGGAAGCC ACACGTGAGG CTGTGGTGCG TGCCTCGAAC CTGCCCATGT 180 

CAGTGATCAT TATGGGTGGT AAATGGCT 2 08 

(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

AGCCATTTAC CACCCATACT AAATTCTAGT TCAAACTCCA ACTTCTTCCA TAAAACATCT 6 0 

AACCACTGAC ACCAGTTGGC AATAGCTTCT TCCTTCTTTA ACCTCTTAGA: GTATTTATGG 12 0 

TCAATGCCAC ACATTTCTGC AACTGAATAA AGTTGGTAAG GCAAGAGGAG C 171 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 base pairs 
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(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

CGGGTGATGC CTCCTCAGGC TTTGGTGTGT CCACTCNACT CACTGGCCTC TTCTCCAGCA 
ACTGGTGAAN ATGTCCTCAN GAAAANCNCC ACACGCNGCT CAGGGTGGGG TGGGAANCAT 
CANAATCATC NGGC 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

AGAGGGTACA TATGCAACAG TATATAAAGG AAGAAGTGCA CTGAGAGGAA CTTCATCAAG 
GCCATTTAA.T CAATAAGTGA TAGAGTCAAG GCTCAACCCA GGTGTGACGG ATTCCAGGTC 
CCAAGCTCCT TACTGGTACC CTCTT 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 297 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



ix?) SEQUENCE DESCRIPTION: SEQ 3D NO:63: 

TGCACTGAGA GGAATTCAAA GGGTTTATGC CAAAGAACAA ACCAGTCCTC TGCAGCCTAA 
CTCATTTGTT TTTGGGCTGC GAAGCCATGT AGAGGGCGAT CAGGCAGTAG ATGGTCCCTC 
CCACAGTCAG CGCCATGGTG GTCCGGTAAA GCATTTGGTC AGGCAGGCCT CGTTTCAGGT 
AGACGGGCAC ACATCAGCTT TCTGGAAAAA CTTTTGTAGC TCTGGAGCTT TGTTTTTCCC 
AGCATAATCA TACACTGTGG AATCGGAGGT CAGTTTAGTT GGTAAGGCAA GAGGAGC 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GCACTGAGAG GAACTTCCAA TACTATGTTG AATAGGAGTG GTGAGAGAGG GCATCCTTGT 
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CTTGTGCCGG TTTTCAAAGG GAATGCTTCC AGCTTTTGCC CATTCAGTAT AATATTAAAG 
AATGTTTTAC CATTTTCTGT CTTGCCTGTT TTTCTGTGTT TTTGTTGGTC TCTTCATTCT 
CCATTTTTAG GCCTTTACAT GTTAGGAATA TATTTCTTTT AATGATACTT CACCTTTGGT 
ATCTTTTGTG AGACTCTACT CATAGTGTGA TAAGCACTGG GTTGGTAAGG CAAGAGGAGC 



120 



240 
300 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STPJ^NDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

GCTCCTCTTG CCTTACCAAC TCACCCAGTA TGTCAGCAAT TTTATCRGCT TTACCTACGA 60 
AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATCGcS 
CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CTTTCTCTTT TCCTAGAACA TGCATT^SJ 180 
TCGATAGAAG TTCCTCTCAG TGC ^^m^Ki^ 180 

203 

(2) INFORMATION FOR SEQ ID NO: 66: ' 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:66: 

TACGGGGACC CCTGCATTGA GAAAGCGAGA CTCACTCTGA AGCTGAAATG CTGTTGCCCT 
TGCAGTGCTG GTAGCAGGAG TTCTGTGCTT TGTGGGCTAA GGCTCCTGGA TGACCCCTGA 
CATGGAGAAG GCAGAGTTGT GTGCCCCTTC TCATGGCCTC GTCAAGGCAT CATGGACTGC 
CACACACAAA ATGCCGTTTT TATTAACGAC ATGAAATTGA AGGAGAGAAC ACAATTCACT 
S™^'''''^'' GTAACCATGG ATATGGTCAC ATACAGAGGT GTGATTATGT AAAGGTTAAT 
TCCACCCACC TCATGTGGAA ACTAGCCTCA ATGCAGGGGT CCCA 

(2) INFORMATION, FOR SEQ ID NO: 67: 



60 
120 
180 
240 
300 
344 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

Sr^S^^^ GAACTTCGTA GGGAGGTTGA ACTGGCTGCT GAGGAGGGGG AACAACAGGG 
TAACCAGACT GATAGCCATT GGATGGATAA TATGGTGGTT GAGGAGGGAC ACTACTTATA 
GCAGAGGGTT GTGTATAGCC TGAGGAGGCA TCACCCG ^^^^^^^^^^ ACTACTTATA 



60 
120 
157 
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(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

GCACTGAGAG GAACTTCTAG AAAGTGAAAG TCTAGACATA AAATAAAATA AAAATTTAAA 6 0 

ACTCAGGAGA GACAGCCCAG CACGGTGCCT CACGCCTGTA ATCCCAGAAC TTTGGGAGCC 12 0 

TGAGGAGGCA TCACCCG 13 7 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHAPJ^CTER I ST I CS : 

(A) LENGTH: 137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCi^: DESCRIPTION: SEQ ID NO: 69: 

CGGGTGATGC CTCCTCAGGC TGTATTTTGA AGACTATCGA CTGGACTTCT TATCAACTGA 6 0 

AGAATCCGTT AAAAATACCA GTTGTATTAT TTCTACCTGT CAAAATCCAT TTCAAATGTT 12 0 

GAAGTTCCTC TCAGTGC 13 7 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

AGCATGTTGA GCCCAGACAC GC7UVTCTGAA TGAGTGTGCA CCTCAAGTAA ATGTCTACAC 6 0 

GCTGCCTGGT CTGACATGGC ACACCATCNC GTGGAGGGCA CASCTCTGCT CNGCCTACWA 12 0 

CGAGGGCANT CTCATWGACA GGTTCCACCC ACCAAACTGC AAGAGGCTCA NNAAGTACTR 18 0 

CCAGGGTMYA SGGACMASGG TGGGAYTYCA YCACWCATCT 22 0 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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ixi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 

CGTTAGGGTC TCTATCCACT GCTAAACCAT ACACCTGGGT AAACAGGGAC CATTTAACAT 6 0 

TCCCAI^ICTAA ATATGCCAAG TGACTTCACA TGTTTATCTT AAAGATGTCC AAAACGCAAC 12 0 

TGATTTTCTC CCCTAAACCT GTGATGGTGG GATGATTAAN CCTGAGTGGT CTACAGCAAG 180 

TTAAGTGCAA GGTGCTAAAT GAANGTGACC TGAGATACAG CATCTACAAG GCACTACCTC -^4 0 

TCAACNCAGG GCAACTTTGC TTCTCANAGG GCATTTAGCA GTGTCTGAAG TAATTTCTGT 3 00 

ATTACAACTC ACGGGGCGGG GGGTGAATAT CTANTGGANA GNAGACCCTA ACG 3 53 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear " ' ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO ; 72 : 

GCACTGAGAG GAACTTCCAA TACYATKATC AGAGTGAACA RGCARCCYAC AGAACAGGAG 6 0 

AAAATGTTYG CAATCTCTCC ATCTGACAAA AGGCTAATAT CCACAWTCTA AWAGGAACTT 12 0 

AAACAAATTT ATGAGAAAAG AACARACAAC CTCAWCAAAA AGTGGGTGAA GGAWATGCTS 180 

AAARGAAGAC ATYTATTCAG CCAGTAAACA YATGAAAAAA AGGCTCATSA TCACTGAWCA 24 0 

TTAGAGAAAT GCAAATCAAA ACCACAATGA GATACCATCT YAYRCCAGTT^ AGA2^^?GGTGA 3 00 

TCATTAAAAR STCAGGAAAC AACAGATGCT GGACAAGGTG TCA 343 

{2)- INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single ' ■ ' 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GCACTGAGAG GAACTTCAGA GAGAGAGAGA GAGTTCCACC CTGTACTTGG GGAGAGAAAC 60 

AGAAGGTGAG AAAGTCTTTG GTTCTGAAGC AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 12 0 

TCAAAGTTCC CATGCTGCCA AAGTGCCATC CTTTGGGGTA CTGTTTTCTG AGCTCCAGTG 180 

ATA.ACTCATT TATACAAGGG AGATACCCAG AAAAAAAGTG AGCAAATCTT AAAAAGGTGG 24 0 

CTTGAGTTCA GCCTTAAATA CCATCTTGAA ATGACACAGA GAAAGAAIvIGA TGTTGGGTGG 3 00 
GAGTGGATAG AGACCCTAAC G 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHAJ^CTERISTICS : 

(A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid = 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



321 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
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(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7t: 

CGTTAGGGTC TCTATCCACT CCCACTACTG ATCAAACTCT ATTTATTTAA TTATTTTTAT 
CATACTTTAA GTTCTGGGAT ACACGTGCAG CATGCGCAGG TTTGTTGCAT AGGTATACAC 
TTGCCATGGT GGTTTGCTGC ACCCATCAGT CCATCATCTA CATTAGGTAT TTCTCCTAAT 
GCTATCCCTC CCCTAGCCCC TTACACCCCC AACAGGCTCT AGTGTGTGAA GTTCCTCTCA 
GTGC 

(2) INFORMATION FOR SEQ ID NO : 77 : 
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GCACTGAGAG GAACTTCAGA GAGAGAGAGA GAGTTCCACC CTGTACTTGG GGAGAGAAAC 

AGAAGGTGAG AAAGTCTTTG GTTCTGAAGC AGCTTCTAAG ATCTTTTCAT TTGCTTCATT 

TCAAAGTTCC CATGCTGCCA AAGTGCCATC CTTTGGGGTA CTGTTTTCTG AGCTCCAGTG 

ATAACTCATT TATACAAGGG AGATACCCAG AAAAAAAGTG AGCAAATCTT AAAAAGGTGG 

CTTGAGTTCA GYCTTAAATA CCATCTTGAA ATGAMACAGA GAAAGAAGGA TGTTGGGTGG 

GAGTGGATAG AGACCCTAAC G 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 75 : 

GCACTGAGAG GAACTTCCAC ATGCACTGAG AAATGCATGT TCACAAGGAC TGAAGTCTGG 6 0 

AACTCAGTTT CTCAGTTCCA ATCCTGATTC AGGTGTTTAC CAGCTACACA ACCTTAAGCA 12 0 

AGTCAGATAA CCTTAGCTTC CTCATATGCA A:^TG;^GAAT GAAAAGTACT CATCGCTGAA 180 

TTGTTTTGAG GATTAGAAAA ACATCTGGCA TGCAGTAG/^J^ ATTCAATTAG TATTCATTTT 240 

CATTCTTCTA AATTAAACAA ATAGGATTTT TAGTGGTGGA ACTTCAGACA CCAGAAATGG 3 00 
GAGTGGATAG AGACCCT 



317 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
CGTTAGGGTC TCTATCCACT GAAATCTGAA GCACAGGAGG AAGAGAAGCA GTYCTAGTGA 6 0 
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GATGGCAAGT TCWTTTACCA GACTCTTTAA CATTTYGTTT AGTTTTAACC TTTATTTATG 12 0 

GATAATAAAG GTTAATATTA ATAATGATTT ATTTTAAGGC ATTCCCRAAT TTGCATAATT 18 0 

CTCCTTTTGG AGATACCCTT TTATCTCCAG TGCAAGTCTG GATCAAAGTG ATASAMAGAA 24 0 

GTTCCTCTCA GTGC 



(2) INFORMATION FOR SEQ ID NO: 78: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STPJVNDEDNESS : single 

(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 



254 



TTCGATACAG GCAJKACATGA ACTGCAGGAG GGTGGTGACG ATCATGATGT- TGCCGATGGT 6 0 

CCGGATGGNC ACGAAGACGC ACTGGANCAC GTGCTTACGT CCTTTTGCtC TGTTGATGGC 12 0 

CCTGAGCGGA CGCAGGACCC TTATGACCCT CAGAATCTTC ACAACGGGAG ATGGCACTGG 180 

ATTGANTCCC ANTGACACCA GAGACACCCC AACCACCAGN ATATC.ANTAT ATTGATGTAG 24 0 

TTCCTGTAGA NGGCCCCCTT GTGGAGGAAA GCTCCATNAG TTCdTcAtCT TCAACAGGAT 3 00 

CTCAACAGTT TC^TGATGGCT GTGATGGGCA TAGTCATANT TAACCNTGTN TCGAA 3 55 

(2) ■ INFORiMATION FOR SEQ ID NO : 7 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

TAAGAGGGTA CCAGCAGAAA GGTTAGTATC ATCAGATAGC ATCTTATACG AGTAATATGC 6 0 

CTGCTATTTG AAGTGTAATT GAGAAGGAAA ATTTTAGCGT GCTCACTGAC CTGCCTGTAG 12 0 

CCCCAGTGAC AGCTAGGATG TGCATTCTCC AGCCATCAAG AGACTGAGTC AAGTTGTTCC 180 

TTAAGTCAGA ACAGCAGACT CAGCTCTGAC ATTCTGATTC GAATGACACT GTTCAGGAAT 24 0 

CGG AATCCTG TCGATTAGAC TGGACAGCTT GTGGCAAGTG AATTTGCCTG TAACAAGCCA 3 00 

GATTTTTTAA AATTTATATT GTAAATAATG TGTGTGTGTG TGTGTGTATA TATATATATA 360 

TGTACAGTTA TCTAAGTTAA TTTAAAAGTT GTTTGGTACC CTCTTA 4 06 

(2) INFORMATION FOR SEQ ID NO: 80: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

'^'TTTTTTTT TTTACTCGGC TCAGTCTAAT CCTTTTTGTA GTCACTCATA GGCCAGACTT 60 
AGGGCTAGGA TGATGATTAA TAAGAGGGAT GACATAACTA TTAGTGGCAG GTTAGTTGTT 12 0 
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TGTAGGGCTC ATGGTAGGGG T.AAAJVGGAGG GCAATTTCTA GATCAAATAA TAACAAGGTA 

ATAGCTACTA AGAAGAATTT TATGGAGAAA GGGACGCGGG CGGGGGATAT AGGGTCGAAG 

CCGCACTCGT AAGGGGTGGA TTTTTCTATG TAGCCGTTGA GTTGTGGTAG TCAAAATGTA 
ATAATTATTA GTAGTAAGCC TAGGAGA 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



_(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 81: 

TAGTCTATGC GGTTGATTCG GCAATCCATT ATTTGCTGGA TTTTGTCATG TGTTTTGCCA 

ATTGCATTCA TAATTTATTA TGCATTTATG CTTGTATCTC CTAAGTCATG GTATATAATC 

CATGCTTTTT ATGTTTTGTC TGACATAAAC TCTTATCAGA GCCCTTTGCA CACAGGGATT 

CAATAAATAT 7'AACACAGTC TACATTTATT TGGTGAATAT TGCATATCTG CTGTACTGAA 

AGCACATTAA GT.aACAAAGG CAAGTGAGAA GAATGAAAAG CACTACTCAC AACAGTTATC 
ATGATTGCGC ATAGACTA 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRATJDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

TCTTCAACCT CTACTCCCAC TAATAGCTTT TTGATGACTT CTAGCAAGCC TCGCTAACCT 
CGCCTTACCC CCCACTATTA ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACGTTCTC 
CTGATCAAAT ATCACTCTCC TACTTACAGG ACTCAACATA CTAGTCACAG CCCTATACTC 
CCTCTACATA TTTACCACAA CACAATGGGG CTCACTCACC CACCACATTA ACAACATAAA 
ACCCTCATTC ACACGAGAAA ACACCCTCAT GTTCATACAC CTATCCCCCA TTCTCCTCCT 
ATCCCTCAAC CCCGACATCA TTACCGGGTT TTCCTCTT 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 
(E) TYPE: nucleic acid 
(G) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AGGAATAAAA 60 
ATAGACTTTG AACAAAAAGG AACATTTGCT GGCCTGAGGA GGCATCACCC G 111 
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(2) INFORMATION FOR SEQ ID NO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 224 base pairs 

(B) TYPE: nucleic acid 

(C) GTRAITOEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: 

TCGGGTGATG CCTCCTCAGG CCA>\GAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 

AAGGAAGAAA GGAGAAAAAA GGGCATCATC CCCGTTCCGA AGGGTCAGGG AGGAGGAAAT 

TGAGGTGGAT TCACGAGTTG CGGACAACTC CTTTGATGCC AAGCGAGGTG CAGCCGGAGA 

CTGGGGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

= (A) LENGTH: 348 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

GCACTGAGAG GAACTTCGTT GGAAACGGGT TTTTTTCATG TAAGGCTAGA CAGAAGAATT 6 0 

CTCAGTAACT TCCTTGTGTT GTGTGTATTC AACTCACASA GTTGAACGAT CCTTTACACA 12 0 

GAGCAGACTT GTAACACTCT TWTTGTGGAA TTTGCAAGTG GAGATTTCAG SCGCTTTGAA 180 

GTSAAAGGTA GAAAAGGAAA TATCTTCCTA TAAAAACTAG ACAGAATGAT TCTCAGAAAC 24 0 

TCCTTTGTGA TGTGTGCGTT CAACTCACAG AGTTTAACCT TTCWTTTCAT AGAAGCAGTT 3 00 

AGGAAACACT CTGTTTGTAA AGTCTGCAAG TGGATAGAGA CCCTAACG 34 8 

(2) INFORMATION FOR SEQ ID NO:86: ' 

(i) SEQUENCE CHARACTERISTICS: 
. .^A) LENGTH: 293 base pairs 
' (B) TYPE: nucleic- acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

GCACTGAGAG GAACTTCYTT GTGWTGTKTG YATTCAACTC ACAGAGTTGA ASSWTSMTTT 
ACABAGWKCA GGCTTKCAAA CACTCTTTTT GTMGAATYTG CAAGWGGAKA TTTSRRCCRC 
TTTGWGGYCW WYSKTMGAAW MGGRWATATC TTCWYATMRA AMCTAGACAG AAKSATTCTC 
AKAAWSTYYY YTGTGAWGWS TGCRTTCAAC TCACAGAGKT KAACMWTYCT KYTSATRGAG 
CAGTTWKGAA ACTCTMTTTC TTTGGATTCT GCAAGTGGAT AGAGACCCTA ACG 

(2) INFORMATION FOR SEQ ID NO: 87: 

( i ) . SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 10 base pairs 
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(B) TYPE: nucleic acid 

(C) STR/JJDEDNES3 : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
CTCCTAGGCT 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STPJUJDEDNESS : single 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 88 : 
AGTAGTTGCC 

(2) INFORr4ATION FOR SEQ ID NO : 8 9 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
TTCCGTTATG C 

(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 
TGGTAAAGGG 

(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 
TCGGTCATAG 

10 

(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH. 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO : 92 : 
TACAACGAGG 

10 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TGGATTGGTC 

10 

(2) INFORMATION FOR SEQ ID NO : 94 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 : 
CTTTCTACCC 

10 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 
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TTTTGGCTCC 

(2) INFORMATION FOR SEQ ID NO:96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xj) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GGAACCAATC 

(2) INFORMATION FOR SEQ ID NO: 97: 

<i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOrOLOGY; linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
TCGATACAGG 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: 

GGTACTAAGG 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
AGTCTATGCG 
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(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
CTATCCATGG 10 
(2) INFORMATION FOR SEQ ID NO: 101: 

" ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs, 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101- r 
TCTGTCCACA 10 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.:. 102 : 
AAGAGGGTAC 10 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO -103: 
CTTCAACCTC 10 
(2) INFORMATION FOR SEQ ID NO; 104: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRA^JDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
GCTCCTCTTG CCTTACCAAC 2 0 

(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 
GTAAGTCGAG CAGTGTGATG 2 0 

(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GTAAGTCGAG CAGTCTGATG 2 0 

(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
GACTTAGTGG AAAGAATGTA 2 0 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
GTAATTCCGC CAACCGTAGT 

(2) INFORMATION FOR SEQ ID NO: 109: 

(A; LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
ATGCTTGATC GATAGTGGAA 

(2) lNFOKf4ATI0N FOR SEQ ID rJO:110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s i ng 1 e 
{D} TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO:110: 
ACGGGGACCC CTGCATTGAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 1 : 
TATTCTAGAC CATTCGCTAC 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 
ACATAACCAC TTTAGCGTTC 2 0 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

CGGGTGATGC CTCCTCAGGC 2 0 



(2) INFORMATION FOR SEQ ID NO: 114: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 
AGCATGTTGA GCCCAGACAC 2 0 

(2) INFORMATION FOR SEQ ID NOrllS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
GACACCTTGT CCAGCATCTG 2 0 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 
TACGCTGCAA CACTGTGGAG 20 
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(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
CGTTAGGGTC TCTATCCACT 2 0 

(2) .INFORMATION FOR SEQ ID NO r 1 1 8 t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
AGACTGACTC ATGTCCCCTA 2 0 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs • ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single - ^ 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
TCATCGCTCG GTGACTCAAG 2 0 

(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear - - 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0: 
CAAGATTCCA TAGGCTGACC 2 0 

(2) INFORMATION FOR SEQ ID NO: 121: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairi^ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
iD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

ACGTACTGGT CTTGAAGGTC 2 0 

(2) INFORMATION FOR SEQ ID NO; 122: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(E) TYPE: nucleic acid 

(C) STRAITDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 
GACGCTTGGC CACTTGACAC 2 0 

(2) INFORr4ATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
GTATCGACGT AGTGGTCTCC 2 0 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 
TAGTGACATT ACGACGCTGG 2 0 

(2) INFORMATION FOR SEQ ID NO: 12 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: 



SEQ ID NO : 12 5 : 



CGGGTGATGC CTCCTCAGGC 



20 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATGGCTATTT TCGGGCGCTG ACA 2 3 

(2) INFORMATION FOR SEQ ID NO: 12 7: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs. 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 
CCGGTATCTC CTCGTGGGTA TT 2 2 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 



(C) STPJUJDEDNESS : single 

(D) TOPOLOGY: linear 



(:<i) 



SEQUENCE DESCRIPTION: SEQ ID NO: 12 8: 



CTGCCTGAGC CACAAATG 



18 



(2) INFORMATION FOR SEQ ID NO: 12 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 9: 
CCGGAGGAGG AAGCTAGAGG AATA 2 4 

(2) INFORMATION FOR SEQ ID NO: 13 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 0: 
TTTTTTTTTT TTAG 14 
(2) INFORMATIOII FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

Ser Ser Gly Gly Arg Thr Phe Asp Asp Phe His Arg Tyr Leu Leu Val 
15 10 15 

Gly lie 



(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Gin Gly Ala Ala Gin Lys Pro lie Asn Leu Ser Lys Xaa lie Glu Val 
15 10 15 

Val Gin Gly His Asp Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 3: 

Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu Ala Tyr Arg He Tyr 
i 5 10 15 

Thr Pro Phe Asp Leu Ser Ala 
2 0 

(2) INFORMATION FOR SEQ ID NO:134: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 134: 

Tyr Leu Leu Val Gly He Gin Gly Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 135: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

Gly Ala Ala Gin Lys Pro He Asn Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STR7VNDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 6: 

Asn Leu Ser Lys Xaa He Glu Val Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13 7: 
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(l) sequence CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: ammo acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 137: 

Glu Val Val Gin Gly His Asp Glu Ser 
1 5 

(2) INFORMATIOrJ FOR SEQ ID NO: 13 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A^. LSNCTH: 9 amine acids 
(L-) TYPE: amino acid 
(''^) STRANDEDNESS: :jincj]e 
iV) TOPOLOGY: linear 



ixi) SEOL'ENCE DESCRIPTION: SEC ID NO: 13 8: 

His Le.^ Gin Glu Ala Tyr Arg He Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEVUENCE CHARACTERISTICS: 

(A) LENGTH: 9 arrino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(L) TOPOLOGY: linear 



(xi) SEOUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Asn Leo Ala Phe Val Ala Gin Ala A] a 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14 0: 

(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 ammo acids 

(B) TYPE: amino acid 

(C) STPJiNDEDNESS : single 

(D) TOPOLOGY: linear 



(yii) SEQUENCE DESCRIPTION: SEQ ID NO. 14 0: 

Phe Val Ala Gin Ala Ala Pro Asp Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 141: 
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(i) SEQUENCE CHA.RACTERISTICS : 

(A) LENGTH: 9388 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS T single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 



GCTCGCGGCC 


GCGAGCTCPJ\ 


TTAACCCTCA 


CTAAAGGGAG 


TCGACTCGAT 


C^VCj .A. C T G T" T" A 


60 


CTGTGTCTAT 


GTAGAAAGAA 


GTAGACATAA 


GAGATTCCAT 


TTTGTTCTGT 


ACTAAGAAAA 


12 0 


ATTCTTCTGC 


CTTGAGATGC 


TGTTAATCTG 


TAACCCTAGC 


CCCAACCCTG 


TGCTCACAGA 


180 


GACATGTGCT 


GTGTTGACTC 


AAGGTTCAAT 


GGATTTAGGG 


CTATGCTTTG 


TTAAAAAAGT 


24 0 


GCTTGAAGAT 


AATATGCTTG 


TTAAAAGTCA 


TCAGCATTCT 


CTAATCTCAA 


GTACCCAGGG 


3 00 


ACACAATACA 


CTGCGGAAGG 


CCGCAGGGAC 


CTCTGTCTAG 


CAAAGCCAGG 


TATTGTCCAA 


3 6 0 


GATTTCTCCC 


CATGTGATAG 


CCTGAGATAT 


GGCCTCATGG 


GAAGGGTAAG 


ACCTGACTGT 


420 


CCCCCAGCCC 


GACATCCCCC 


AGCCCGACAT 


GCCCCAGCGC 


GACACCGGAA 


AAGGGTCTGT 


4 80 


GCTGAGGAGG 


ATTAGTAAAA 


GAGGAAGGCC 


TCTTTGCAGT 


TGAGGTAAGA 


GGAAGGCATC 


5 4 0 


TGTCTCCTGC 


TCGTCCCTGG 


GCAATAGAAT 


GTCTTGGTGT 


AAAACCCGAT 


TGTATGTTCT 


6 00 


ACTTACTGAG 


ATAGGAGAAA 


ACATCCTTAG 


GGCTGGAGGT 


GAGACACGCT 


GGCGGCAATA 


6 6 0 


CTGCTCTTTA 


ATGCACCGAG 


ATGTTTGTAT 


AAGTGCACAT 


CAAGGCACAG 


CACCTTTCCT 


72 0 


TAAACTTATT 


TATGACACAG 


AGACCTTTGT 


TCACGTTTTC 


CTGCTGACCC 


TCTCCCCACT 


7 80 


ATTACCCTAT 


TGGCCTGCCA 


CATCCCCCTC 


TCCGAGATGG 


TAGAGATAAT 


- GATCAATAAA 


84 0 


TACTGAGGGA 


ACTCAGAGAC 


CAGTGTCCCT 


GTAGGTCCTC 


CGTGTGCTGA 


GCGCCGGTCC 


900 


CTTGGGCTCA 


CTTTTCTTTC 


TCTATACTTT 


GTCTCTGTGT 


CTCTTTCTTT 


TCTCAGTCTC 


960 


TCGTTCCACC 


TGACGAGAAA 


TACCCACAGG 


TGTGGAGGGG 


CAGGCCACGC 


dTTCAATAAT 


1020 


TTACTAGCCT 


GTTCGCTGAC 


AACAAGACTG 


GTGGTGCAGA 


AGGTTGGGTC 


TTGGTGTTCA 


1080 


CCGGGTGGCA 


GGCATGGGCC 


AGGTGGGAGG 


GTCTCCAGCG 


CCTGGTGCAA 


ATCTCCAAGA 


1140 


AAGTGCAGGA 


AACAGCACCA 


AGGGTGATTG 


TAAATTTTGA 


TTTGGGGCGG 


CAGGTAGCCA 


1200 


TTCCAGCGCA 


AAAATGCGCA 


GGAAAGCTTT 


TGCTGTGCTT 


GTAGGCAGGT 


AGGCCCCAAG 


1260 


CACTTCTTAT 


TGGCTAATGT 


GGAGGGAACC 


TGCACATCCA 


TTGGCTGAAA 


TCTCCGTCTA 


1320 


TTTGAGGCTG 


ACTGAGCGCG 


TTCCTTTCTT 


CTGTGTTGCC 


tggaaacgga 


CTGTCTGCCT 


1380 


AGTAACATCT 


GATCACGTTT 


CCCATTGGCC 


GCCGTTTCCG 


GAAGCCCGCC 


CTCCCATTTC 


1440 


CGGAAGCCTG 


GCGCAAGGTT 


GGTCTGCAGG 


TGGCCTCCAG 


gtgcaaagtg 


GGAAGTGTGA 


1500 


GTCCTCAGTC 


TTGGGCTATT 


CGGCCACGTG 


CCTGCCGGAC 


atgggacgct 


GGAGGGTCAG 


1560 


CAGCGTGGAG 


TCCTGGCCTT 


TTGCGTCCAC 


GGGTGGGAAA 


ttggccattg 


CCACGGCGGG 


1620 


AACTGGGACT 


CAGGCTGCCC 


CCCGGCCGTT 


TCTCATCCGT 


ccaccggact 


CGTGGGCGCT 


1680 


CGCACTGGCG 


CTGATGTAGT 


TTCCTGACCT 


CTGACCCGTA 


ttgtctccag 


ATTAAAGGTA 


1740 


AAAACGGGGC 


TTTTTCAGCC 


CACTCGGGTA 


AAACGCCTTT 


tgatttctag 


GCAGGTGTTT 


1800 


TGTTGCACGC 


CTGGGAGGGA 


GTGACCCGCA 


GGTTGAGGTT 


tattaaaata 


CATTCCTGGT 


1860 


TTATGTTATG 


TTTATAATAA 


AGCACCCCAA 


CCTTTACAAA 


atctcacttt 


TTGCCAGTTG 


1920 


TATTATTTAG 


TGGACTGTCT 


CTGATAAGGA 


CAGCCAGTTA 


aaatggaatt 


TTGTTGTTGC 


1980 


TAATTAAACC 


AATTTTTAGT 


TTTGGTGTTT 


^GTCCTAATAG 


caacaacttc 


TCAGGCTTTA 


2040 


TAAAACCATA 


TTTCTTGGGG 


GAAATTTCTG 


TGTAAGGCAC 


agcgagttag 


TTTGGAATTG 


2100 


TTTTAAAGGA 


AGTAAGTTCC 


TGGTTTTGAT 


ATCTTAGTAG 


tgtaatgccg 


AACCTGGTTT 


2160 


TTACTAACCC 


TGTTTTTAGA 


CTCTCCCTTT 


CCTTAAATCA 


cctagccttg 


TTTCCACCTG 


2 2 2 0 


AATTGACTCT 


CCCTTAGCTA 


AGAGCGCCAG 


ATGGACTCCA 


tcttggctct 


TTCACTGGCA 


2280 


GCCCCTTCCT 


CAAGGACTTA 


ACTTGTGCAA 


GCTGACTCCC 


agcacatcca 


AGAATGCAAT 


2340 


TAACTGTTAA 


GATACTGTGG 


CAAGCTATAT 


CCGCAGTTCC 


gaggaattca 


■TCCGATTGAT 


2400 


TATGCCCAAA 


AGCCCCGCGT 


CTATCACCTT 


GTAATAATCT 


taaa.gcccct 


GCACCTGGAA 


2460 


CTATTAACTT 


TCCTGTAACC 


ATTTATCCTT 


TTAACTTTTT 


tgcttacttt 


ATTTCTGTAA 


2520 


AATTGTTTTA 


ACTAGACCTC 


CCCTCCCCTT 


TCTAAACCAA 


'agtataaaag 


AAGATCTAGC 


2580 


CCCTTCTTCA 


GAGCGGAGAG 


TU^TTTTGAGC 


ATTAGCCATC 


tcttggcggc 


CAGCTAAATA 


2640 


AATGGACTTT 


TAATTTGTCT 


CAAAGTGTGG 


CGTTTTCTCT 


aactcgctca 


GGTACGACAT 


2700 
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TTGGAGGCCC CAGCGAG/^ CGTCACCGGC AGAAACGTCA CCGGGCGAGA GCCGGGCCCG 2 7 60 

CTGTGTGCTC CCCCGGAAGG ACAGCCAGCT TGTAGGGGGG AGTGCCACCT GAAAAAAAAA 2 8 20 

TTTCCAGGTC CCCAAAGGGT GACCGTCTTC CGGAGGACAG CGGATCGACT ACCATGC3GG 2 8 80 

TGCCCACCAA A^^.TTCCACCT CTGAGTCCTC AACTGCTGAG CCCGGGGTCA GGTAGGTCAG 2 94 0 

ATTTGACTTT GGTTCTGGCA GAGGGAAGCG ACCCTGATGA GGGTGTCCCT CTTTTGACTC 3 0 00 

TGCGCATTTC TCTAGGATGC TAGAGGGTAG AGCCCTGGTT TTCTGTTAGA CGGCTCTGTG 3 060 

TCTCTGTCTG GGAGGGAAGT GGCCCTGACA GGGGCCATCC CTTGAGTCAG TCCACATCCC 312 0 

AGGATGCTGG GGGACTGAGT CCTGGTTTCT GGCAGACTGG TCTCTCTCTC TCTCTTTTTC 3180 

TATCTCTAAT CTTTCCTTGT TCAGGTTTCT TGGAGAATCT CTGGGAAAGA AAAAAGAAAA 3 24 0 

ACTGTTATAA ACTCTGTGTG AATGGTGAAT GAATGGGGGA GGACAAGGGC TTGCGCTTGT 3 3 00 

CCTCCAGTTr GTAGCTCCAC GGCGAAAGCT ACGGAGTTCA AGTGGGCCCT CACCTGCGGT 3 3 60 

TCCGTGGCGA CCTCATAAGG CTTAAGGCAG CATCCGGCAT AGCTCGATCC GAGCCGGGGG 3 4 20 

TTTATACCGO CCTGTCjf^J^TG CTAAGAGGAG CCCAAGTCCC CTAAGGGGGA GCGGCCAGGC 3 4 80 

GGGGATCTGA CTGATCCCAT CACGGGAGCC CCTCCCCTTG TTTGTCTAAA AAAAAAAAAA 3 54 0 

GAAGAAACTG TCATi^J^vCTGT TTACATGCCC TAGGGTC/,AC TGTTTGTTTT ATGTTTATTG 3 6 00 

TTCTGTTCGG TGTCTATTGT CTTGTTTAGT GGTTGTCAAG GTTTTGCATG TCAGGACGTC 3 660 

GATATTGCCC A/^.GACGTCTG GGTAAGAACT TCTGGAAGGT CCTTAGTGCT GATTTTTTGT 3 72 0 

CACAGGAGGT TAAATTTCTC ATCAATCATT TAGGCTGGCC ACGACAGTCC TGTCTTTTCT 3 7 80 

GCCACAAGCA AGTCAGGTGT TGTTACGGGA ATGAGTGTAA AAAAACATTC GCCTGATTGG 3 84 0 

GATTTCTGGC ACCATGATGG TTGTATTTAG ATTGTCATAC CCCACATCCA GGTTGATTGG 3 900 

ACCTCCTCTA AACTAAACTG GTGGTGGGTT CAAAACAGCC ACCCTGCAGA TTTCCTTGCT 3 96 0 

CACCTCTTTG Gl CATTCTGT AACTTTTCCT GTGCCCTTAA ATAGCACACT GTGTAGGGAJ^ 4 02 0 

ACCTACCCTC GTACTGCTTT ACTTCGTTTA GATTCTTACT CTGTTCCTCT GTGGCTACTC 4 0 80 

TCCCATCTTA AAAAGGATCC AAGTGGTCCT TTTCCTCCTC CCTGCCCCCT ACCGCACACA 4140 

TCTCGTTTTC CAGTGCGACA GCAAGTTCAG CGTCTCCAGG ACTTGGCTCT GCTCTCACTC 4 200 

CTTGAACCCT T/\AAAGAAAA AGCTGGGTTT GAGCTATTTG CCTTTGAGTC ATGGAGACAC 4 2 6 0 

A^lAAGGTATT TAGGGTACAG ATCTAGAAGA AGAGAGAGAA CACCTAGATC CAACTGACCC 4 3 2 0 

AGGAGATCTC GGGCTGGCCT CTAGTCCTCC TCCCTCAATC TTAAAGCTAC AGTGATGTGG 4 3 80 

CAAGTGGTAT TTAGCTGTTG TGGTTTTTCT GCTCTTTCTG GTCATGTTGA TTCTGTTCTT 4 4^0 

TCGATAGTCC AGCCCCCCAG GGAGTGAGTT TCTCTGTCTG TGCTGGGTTT GATATCTATG 4 50 0 

TTCAAATCTT ATTAAATTGC CTTCAAAAAA AAAAAAAAAA GGGAAACACT TCCTCCCAGC 4 5 60 

CTTGTAAGGG ITGGAGCCCT CTCCAGTATA TGCTGCAGAA TTTTTCTCTC GGTTTCTCAG 4 62 0 

AGGATTATGG AGTCCGCCTT AAAAAAGGCA AGCTCTGGAC ACTCTGCAAA GTAGAATGGC 4 680 

CAAAGTTTGG AGTTGAGTGG CCCCTTGAAG GGTCACTGAA CCTCACAATT GTTCAAGCTG 4 74 0 

TGTGGCGGGT TGTTACTGAA ACTCCCGGCC TCCCTGATCA GTTTCCCTAC ATTGATCAAT 4 800 

GGCTGAGTTT GGTCAGGAGC ACCCCTTCCA TGGCTCCACT CATGCACCAT TCATAATTTT 4 86 0 

ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC GACCCTCAGC CGGTTCAGCT 4 92 0 

CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC CTCACCCAGT CCCACCGCCT 4 98 0 

TAAAACCAGC CTACTCCCTT AGGGTCATCC CATGTCTCCT CGGCTATGTC CCCTGTAGGC 504 0 

TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG AAGTAGCCCC TCTACTACCA 5100 

CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA CCCCCTTCCT GGTTTATGTC 516 0 

CCTTCTTTCT ACTTCTGACT TGTATAATTG GAAAACCCAT AATCCTCCCT TCTCTGAAAA 5220 

GCCCCAGGCT TTGACCTCAC TGATGGAGTC TGTACTCTGG ACACATTGGC CCACCTGGGA 52 8 0 

TGACTGTCAi^i. CAGCTCCTTT TGACCCTTTT CACCTCTGAA GAGAGGGAJajv GTATCCAAAG 5 34 0 

AGAGGCCAAA AAGTACAACC TCACATCAAC CAATAGGCCG GAGGAGGAAG CTAGAGGAAT 54 00 

AGTGATTAGA GACCC AATTG GGACCTAATT GGGACCCAAA TTTCTCAAGT GGAGGGAGAA 54 60 

CTTTTGACGA TTTCCACCGG TATCTCCTCG TGGGTATTCA GGGAGCTGCT CAGAAJ^CCTA 5 52 0 

TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA TGAGTCACCA GGAGTGTTTT 5 5 30 

TAGAGCACCT CCAGGAGGCT TATCGGATTT ACACCCCTTT TGACCTGGCA GCCCCCGAAA 5 64 0 

ATAGCCATGC TCTTAATTTG GCATTTGTGG CTCAGGCAGC CCCAGATAGT AAAAGGAAAC 5 7 00 

TCCAAAAACT AGAGGGATTT TGCTGGAATG AATACCAGTC AGCTTTTAGA GATAGCCTAA 5 7 60 

AAGGTTTTTG ACAGTCAAGA GGTTGAAAAA CAAAAACAAG CAGCTCAGGC AGCTGAAAAA 5 82 0 

AGCCACTGAT AAAGCATCCT GGAGTATCAG AGTTTACTGT TAGATCAGCC TCATTTGACT 5 880 

TCCCCTCCCA :aTGGTGTTT AAATCCAGCT ACACTACTTC CTGACTCAAA CTCCACTATT 5 94 0 

CCTGTTCATG ACTGTCAGGA ACTGTTGGAA ACTACTGAAA CTGGCCGACC TGATCTTCAA 6000 
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AATGTGCCCC TAGGAAAGGT GG ATGCCACC GTGTTCACAG ACAGTAGCAG CTTCCTCGAG 6 06 0 

AAGGGACTAC GAAAGGCCGG TGCAGCTGTT ACCATGGAGA CAGATGTGTT GTGGGCTCAG 6120 

GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA TCGCCCTCAC TCAGGCTCTC 6180 

CGATGGGGTA AGGATATTAA CGTTAACACT GACAGCAGGT ACGCCTTTGC TACTGTGCAT 624 0 

GTACGTGGAG CCATCTACCA GGAGCGTGGG CTACTCACCT CAGCAGGTGG CTGTAATCCA 6 3 00 

CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT GGTAACCAGA AAGCTGATTC 63 60 

AGCAGCTCAA GATGGAGTGT GACTTTCAGT CACGCCTCTA AACTTGCTGC CCACAGTCTC 64 2 0 

CTTTCCACAG CCAGATCTGC CTGACAATGC CGCATACTCA ACAGAAGAAG AAAACTGGCC 64 8 0 

TCAGAACTCA GAGCCAATAA AAATCAGGAA GGTTGGTGGA TTCTTCCTGA CTCTAGAATC 654 0 

TTCATACCCC GAACTCTTGG GAAAACTTTA ATCAGTCACC TACAGTCTAC CACCCATTTA 660 0 

GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGC'CGTTTTA AGATCCCCCA TCTT.CAAAGC 6 66 0 

CTAACAGATC AAGCAGCTCT CCGGTGCACA ACCTGCGCCC. AGGTAAATGC CAAAAAAGGT 6 72 0 

CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC GAGGAGAAAA GTGGGAAATT 6780 

GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAf ACCTTCTAGT ACTGGTAGAC 6 84 0 

ACCTTICTCTG GATGGACTGA AGdATTTGCT ACCAAAAACG AAACTGTCAJ^ TATGGTAGTT 6 90 0 

AAGTTTTTAC TCAATGAAAT CATCCCtCGA CGTGGGCTGC CTGTTGCCAT AGGGTCTGAT 6 96 0 

AATGGACCGG CCTTCGCCTT GTCTATAGTT TAGTCAGTCA GTAAGGCGTT A-AACATTCAA 7 02 0 

TGGAAGCTCC ATTGTGCCTA TCGACCCCAG AGCTCTGGGC AAGTAGAACG CA.TGAACTGC 70 8 0 

ACCCTAAAAA ACACTCTTAC AAAATTAATC TTAGAAACCG GTGTAAATTG TGTAAGTCTC 714 0 

CTTCCTTTAG CCCTACTTAG AGTAAGGTGC ACCCqTTACT GGGCTGGGTT CTTACCTTTT 72 0 0 

GAAATCATGT ATGGGAGGGC GCTGCCTATC TTGCCTAAGC TAAGAGATQC CCAATTGGCA 72 6 0 

AAAATATCAC AAACTAATTT ATTACAGTAC' CTa6aG.TCTC CCCAACAGG.T ACAAGATATC 7 32 0 

ATCCTGCCAC TTGTTCGAGG AACCCATcbC AATCCAATTC. CTGAACAGAC AGGGCCCTGC 73 8 0 

CATTCATTCC CGCCAGGTGA CCTGTTGTTT. GTTAAAAAGT "TCCAGAGAGA AGGACTCCCT 74 4 0 

CCTGCTTGGA AGAGACCTCA CACCGTCATC ACGATGCCAA CGGCTCTGAA GGTGGATGGC 75 00 

ATTCCTGCGT GGATTCATCA CTCCCGCATC AAAAAGGCCA ACGGAGCCCA ACTAGAAACA 756 0 

TGGGTCCCCA GGGCTGGGTC AGGCCCCTTA AAACtGCACC TAAGTTGGGT GAAGCCATTA 7 62 0 

GATTAATTCT TTTTCTTAAT TTTGTAAAAC AATGCATAGC TTCTGTCAAA CTTATGTATC 7 680 

TTAAGACTCA ATATAACCCC CTTGTTATAA CTGAGGAATC AATGATTTGA TTCCCCAAAA 7 74 0 

ACACAAGTGG GGAATGTAGT GTCCAACCTG GTTTTTACTA ACCCTGTTTT TAGACTCTCC 7 800 

CTTTCCTTTA ATCACTCAGC CTTGTTTCCA GCTGAATTGA CTCTCCCTTA GCTAAGAGCG 7 860 

CCAGATGGAC TCCATCTTGG CTCTTTCACT GGCAGCCGCT TCCTCAAGGA CTTAACTTGT 7 92 0 

GCAAGCTGAC TCCCAGCACA TCCAAGAATG CAATTAACfG ATAAGATACT GTGG.CAAGCT 7 98 0 

ATA'TCCGCAG TTCCCAGGAA TTCGTCCAAT TGATTACACC CAAAAGCCCC GCG'tcTATCA 8 04 0 

CCTTGTAATA ATCTTAAAGC CCCTGCACCT GGAACTATJA ACGTTCCTGT AACCATTTAT 810 0 

CCTTTTAACf TTTTTGCCTA CTTTATTTCT GTAAAATTGT TTTAACJAGA CCCCCCCTCT 816 0 

CCTTTCTAA-A CCAAAGTATA AAAGCAAATC TAGCCCCTTC TtCAGGCCGA GAGAATTTCG 82 2 0 

AGCGTTAGCC GTCTCTTGGC CACCAGCTAA ATAAACGGAT TCTTCATGTG TCXCAAAGTG 82 80 

TGGCGTTTTC TCTAACTCGC TCAGGTACGA CCGTGGTAGT ATTTTCCCCA ACGTCTTATT 834 0 

TTTAGGGCAC GTATGTAGAG TAACTTTTAT GAAAGAAACC AGTTAAGGAG GTTTTGGGAT 84 0 0 

TTCCTTTATC AACTGTAATA CTGGTTTTGA TTATTTATTT ATTTATTTAT TTTTTTTGAG 84 6 0 

AAGGAGTTTC ACTCTTGTTG CCCAGGCTGG AGTGCAATGG TGCGATCTTG GCTCACTGCA 8 52 0 

ACTTCCGCCT CCCAGGTTCA AGCGATTCTC CTGCCTCAGC CTCGAGAGTA GCTGGGATTA 8 58 0 

TAGGCATGCG CCACCACACC CAGCTAATTT TGTATTTTTA GTAAAGATGG GGTTTCTTCA 8 64 0 

TGTTGGTCAA GCTGGTCTGG AACTCCCCGC CTCGGGTGAT CTGCCCGCCT CGGCCTCCGA 8700 

AAGTGCTGGG ATTACAGGTG TGATCCACCA CACCCAGCCG. ATTTATATGT ATATAAATCA 8 76 0 

CATTCCTCTA ACCAAAATGT AGTGTTTCCT TCCAtCTTGA. AJATAGGCTG TAGACCCCGT 8 82 0 

GGGTATGGGA CATTGTTAAC • AGTGAGACCA CAGCAGTTTT TATGTCATCT. GACA.GCATCT 8880 

CCAAATAGCC TtCATGGTTG TCACTGCTTC CCAAGACAAT TCCAAATAAC ACTTCCCAGT 8 94 0 

GATGACTTGC TACTTGCTAT TGTTACTTAA TGTGTTAAGG TGGCTGTTAC AGACACTATT 9000 

AGTATGTCAG GAATTACACC AAAATTTAGT GGCTCAAACA ATCATTTTAT TATGTATGTG 906 0 

GAT*rCTCATG GTCAGGTCAG GATTTCAGAC AGGGCACAAG GGTAGCCcKc TTGTCTCTGT 912 0 

CTATGATGTC TGGCCTCAGC ACAGGAGACT CAACAGCTGG GGTCTGGGAC CATTTGGAGG 918 0 

CTTGTTCCCT CACATCTGAT ACCTGGCTTG GGATGTTGGA AGAGGQGGTG AGCTGAGACT 924 0 

GAGTGCCTAT ATGTAGTGTT TCCATATGGC CTTGACTTCC TTACAGCCTG GCAGCCTCAG 93 0 0 
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GGTAGTCAGA ATTCTTAGGA GGCACAGGGC TCCAGGGCAG ATGCTGAGGG GTCTTTTATG 93 60 

AGGTAGCACA GCAAATCCAC CCAGGATC 93 8 8 

(2) INFORMATION FOR SEQ ID NO: 14 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 base pairs 

(B) TYPE: nucleic acid 

(C) STRAiaDEDNESS : single 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

TGTAAGTCGA GCAGTGTGAT GGAAGGAATG GTCTTTGGAG AGAGCATATC CATCTCCTCC 6 0 

TCACTGCCTC CTAATGTCAT GAGGTACACT GAGCAG7VATT AAACAGGGTA GTCTTAACCA 12 0 

CACTATTTTT AGCTACCTTG TCAAGCTAAT GGTTAAAGAJi^ CACTTTTGGT TTACACTTGT 180 

TGGGTCATAG AAGTTGCTTT CCGCCATCAC GCAATAAGTT TGTGTGTAAT CAGAAGGAGT 24 0 

TACCTTATGG TTTCAGTGTC ATTCTTTAGT TAACTTGGGA GCTGTGTAAT TTAGGCTTTG 3 00 

CGTATTATTT CACTTCTGTT CTCCACTTAT GAAGTGATTG TGTGTTCGCG TGTGTGTGCG 360 

TGCGCATGTG CTTCCGGCAG TTAACATAAG CAAATACCCA ACATCACACT GCTCGACTT 419 

(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 3: 

TGTAAGTCGA GCAGTGTGAT GTCCACTGCA GTGTGTTGCT GGGAACAGTT AATGAGCAAA 6 0 

TTGTATACAA TGGCTAGTAC ATTGACCGGG ATTTGTTGAA GCTGGTGAGT GTTATGACTT 12 0 

AGCCTGTTAG ACTAGTCTAT GCACATGGCT CTGGTCAACT ACCGCTCTCT CATTTCTCCA 180 

GATAAATCCC CCATGCTTTA TATTCTCTT'C CAAACATACT ATCCTCATCA CCACATAGTT 24 0 

CCTTTGTTAA TGCTTTGTTC TAGACTTTCC CTTTTCTGTT TTCTTATTCA AACCTATATC 3 00 

TCTTTGCATA GATTGTAAAT TCAAATGCCC TCAGGGTGCA GGCAGTTCAT GTAAGGGAGG 3 60 

GAGGCTAGCC AGTGAGATCT GCATCACACT GCTCGACTTA CA 4 02 

(2) INFORMATION FOR SEQ ID NO: 14 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 224 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 

TCGGGTGATG CCTCCTCAGG CCAAGAAGAT AAAGCTTCAG ACCCCTAACA CATTTCCAAA 6 0 

AAGGAAGAAA GGAGAAAAAA GGGCATCATC CCCGTTCCGA AGGGTCAGGG AGGAGGAAAT 12 0 

TGAGGTGGAT TCACGAGTTG CGGACAACTC CTTTGATGCC AAGCGAGGTG CAGCCGGAGA 180 



wo 98/45328 



Pn7US98/06939 



S3 



CTGGGGAGAG CGAGCCAATC AGGTTTTGAA GTTCCTCTCA GTGC 224 
(2) INFORMATION FOR SEQ ID NO: 14 5: 

(i) SEOUEIJCE CHARACTERISTICS: 

(A) LENGTH: 111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 145: 

AGCCATTTAC CACCCATCCA CAAAAAAAAA AAAAAAAAAG AAAAATATCA AGGAATAAAA 60 
ATAGACTTTG AACAAAAAGG AACATTTGCT GCCCTGAGGA GGCATCACCC G 111 

(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 585 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:.146: 



TAGCATGTTG AGCCCAGACA CTTGTAGAGA GAGGAGGACA GTTAGAAGAA GAAGAAAAGT 6 0 

TTTTAAATGC TGAAAGTTAC TATAAGAAAG CTTTGGCTTT GGATGAGACT TTTAJVAGATG 12 0 

CAGAGGATGC TTTGCAGAAA CTTCATAAAT ATATGCAGGT GATTCCTTAT TTCCTCCTAG ISO 

AAATTTAGTG ATATTTGAAA TAATGCCCAA ACTTAATTTT CTCCTGAGGA AAACTATTCT 24 0 

ACATTACTTA AGTAAGGCAT TATGAAAAGT TTCTTTTTAG GTATAGTTTT TCCTAATTGG 3 00 

GTTTGACATT GCTTCATAGT GCCTCTGTTT TTGTCCATAA TCGAAAGTAA AGATAGCTGT 360 

GAGAAAACTA TTACCTAAAT TTGGTATGTT ' GTTTTGAGAA ATGTCCTTAT AGGGAGCTCA 4 20 

CCTGGTGGTT TTTAAATTAT' TGTTGCTACT ATAATTGAGC TAATTATAAA AACCTTTTTG 4 80 

AGACATATTt TAAATTGTCT TTTCCTGTAA TACTGATGAT GATGTTTTCT CATGCATTTT 54 0 

CTTCTGAATT GGGACCATTG CTGCTGTGTC TGGGCTCACA TGCTA 5 85 



(2) INFORMATION FOR SEQ ID NO: 14 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 



TAGCATGTTG AGCCCAGACA CTGGGCAGCG GGGGTGGCCA CGGCAGCTCC TGCCGAGCCC 60 

AAGCGTGTTT GTCTGTGAAG GACCCTGACG TCACCTGCCA GGCTAGGGAG GGGTCAATGT 12 0 

GGAGTGAATG TTCACCGACT TTCGCAGGAG TGTGCAGAAG CCAGGTGCAA CTTGGTTTGC 180 

TTGTGTTCAT CACCCCTCAA GATATGCACA CTGCTTTCCA AATAAAGCAT CAACTGTCAT 24 0 

CTCCAGATGG GG AAGACTTT TTCTCCAACC AGCAGGCAGG TCCCCATCCA CTCAGACACC 3 00 

AGCACGTCCA CCTTCTCGGG CAGCACCACG TCCTCCACCT TCTGCtGGTA CACGGTGATG 3 60 
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ATGTCAGCAA AGCCGTTCTG GANGACCAGC TGGCCGGTGT GGTGTGCGAT CTCAGTGGCC 42 0 

TCCACCGCGT ACACGGCTCT AGGGGGCGCA TATJTGTGCAC AGAANAAATG ATGATCCAGT 4 80 

CGCACAGCCC AGGTCGAA.GA NGAGTTTATC CGTCAGGGAT TCTTTATTCT GCAGGATGAC 54 0 

CTGTGGTATT AATTGTTCGT GTGTGGGCTC AAGATGCTA 57 9 



(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI>IDEDNESS : single 

(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 8: 



TGACACCTTG TCCAGCATCT GCAAGCCAGG AAGAGAGTCC TCACCAAGAT CCCCACCCCG 6 0 

TTGGCACCAG GATCTTGGAC TTCCAATCTC CAGAACTGTG AGAAATAAGT ATTTGTCGCT 12 0 

AAATAAATCT TTGTGGTTTC AGATATTTAG CTATAGCAGA TCAGGCTGAC TAAGAGAAAC 180 

CCCAT/iA.GAG TTACATACTC ATTAATCTCC GTCTCTATCC CCAGGTCTCA GATGCTGGAC 24 0 

AAGGTGTCA 24 9 



(2) INFORMATION FOR SEQ ID NO: 14 9; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 255 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 9: 



TGACACCTTG TCCAGCATCT GCTATTTTGT GACTTTTTAA TAATAGCCAT TCTGACTGGT 6 0 

GTGAGATGGT AACTCATTGT GGGTTTGGTC TGCATTTCTC TAATGATCAG TGATATTAAG 12 0 

CTTTTTTTAA ATATGCTTGT TGACCACATG TATATCATCT TTTGAGAAGT GTCTGTTCAT 180 

ATCCTTTGCC GACTTTTTAA TTTTTTTATC TTGTAAATTT GTTTAATTTC CTTACAGATG 24 0 

CTGGACAAGG TGTCA 2 55 



(2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 



TTACGCTGCA ACACTGTGGA GGCCAAGCTG GGATCACTTC TTCATTCTAA CTGGAGAGGA 6 0 

GGGAAGTTCA AGTCCAGCAG AGGGTGGGTG GGTAGACAGT GGCACTCAGA AATGTCAGCT 120 

GGACCCCTGT CCCCGCATAG GCAGG>ACAGC AAGGCTGTGG CTCTCCAGGG CCAGCTGAAG 180 

AACAGGACAC TGTCTCCGCT GCCACAAAGC GTCAGAGACT CCCATCTTTG AAGCACGGCC 24 0 

TTCTTGGTCT TCCTGCACTT CCCTGTTCTG TTAGAGACCT GGTTATAGAC AAGGCTTCTC 3 00 
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CACAGTGTTG CAGCGTAA 3 18 

(2) INFORMATION FOR SEQ TD NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 



TNACGCNGCN ACNNTGTAGA GANGGNAAGG CNTTCCCCAC ATTNCCCCTT CATNANAGAA 6 0 

TTATTCNACC AAGNNT<5ACC NATGCCNTTT ATGACTTACA TGCNNACTNC NTAATCTGTN 12 0 

TCNNGCCTTA AAAGCNNNTC CACTACATGC NTCAtaCACTG TNTGTGTNAC NTCATNAACT 130 

GTCNGNAATA GGGGCNCATA ACTACAGAAA TGCANTTCAT ACTGCTTCCA NTGCCATCNG 24 0 

CGTGTGGCCT TNCCTACTCT TCTTNTATTC CA.AGTAGCAT CTCTGGANTG CTTCCCCACT 3 00 

CTCCACATTG TTGCAGCNAT AAT 32 3 



(2) INFORMATION FOR SEQ ID NO: 152: 

( i ) SEQUENCE CHARACTERISTICS : ■ 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 



TCAAGATTCC ATAGGCTGAC CAGTCCAAGG AGAGTTGAAA TCATGAAGGA GAGTCTATCT 6 0 

GGAGAGAGCT GTAGTTTTGA GGGTTGCAAA GACTTAGGAT GGAGTTGGTG GGTGTGGTTA 12 0 

GTCTCTAAGG TTGATTTTGT TCATAAATTT CATGGCCTGA -ATGCCTTGCT TGCCTCACCC 18 0 

TGGTCCAAGC CTTAGTGAAC ACCTAAAAGT CTCTGTCTTC TTGCTCTCCA AACTTCTCCT 24 0 

GAGGATTTCC TCAGATTGTC TACATTCAGA TCGAAGCGAG TTGGCAAACA AGATGCAGTC 3 00 

cagagggtca g - ■ ; : " . 311 



(2) INFORMATION FOR SEQ ID NO: 153: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 153: 



CAAGATTCCA TAGGCTGACC AGGAGGCTAT TCAAGATCTC TGGCAGTTGA GGAAGTCTCT 6 0 

TTAAGAAAAT AGTTTAAACA ATTTGTTAAA ATTTTTCTGT CTTACTl'CAT TTCTGTAGCA 12 0 

GTTGATATCT GGCTGTCCTT TTTATAATGC AGAGTGGGAA CTTTCCCtAC CATGTTTGAT 18 0 

AAATGTTGTC CAGGCTCCAT TGCCAATAAT GTGTTGTCCA AAATGCCTGT TTAGTTTTTA 24 0 

AAGACGGAAC TCCACCCTTT GCTTGGTCTT AAGTATGTAT GGAATGTTAT GATAGGACAT 3 00 

AGTAGTAGGG GTGGTCAGCC TATGGAATCT TG 33 2 
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(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 



TCAAGATTCC ATAGGCTGAC CTGGACAGAG ATCTCCTGGG TCTGGCCCAG GACAGCAGGC 6 0 

TCAAGCTCAG TGGAGAAGGT TTCCATGACC CTCAGATTCC CCCAAACCTT GGATTGGGTG 12 0 

ACATTGCATC TCCTCAGAGA GGGAGGAGAT GTANGTCTGG GCTTCCACAG GGACCTGGTA 180 

TTTTAGGATC AGGGTACCGC TGGCCTGAGG CTTGGATCAT TCANAGCCTG GGGGTGGAAT 24 0 

GGCTGGCAGC CTGTGGCCCC ATTGAAATAG GCTCTGGGGC ACTCCCTCTG TTCCTANTTG 3 00 

AACTTGGGTA AGGAACAGGA ATGTGGTCAN CCTATGGAAT CTTGA 34 5 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHAi?ACTERI STICS : 

(A) LENGTH: 295 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 



GACGCTTGGC CACTTGACAC ATTAAACAGT TTTGCATAAT CA.CTAi^CATG TATTTCTAGT 6 0 

TTGCTGTCTG CTGTGATGCC CTGCCCTGAT TCTCTGGCGT TAATGA.TGGC AAGCATAATC 12 0 

AAACGCTGTT CTGTTAATTC CAAGTTATAA CTGGCATTGA TTAAAGCATT ATCTTTCACA 18 0 

ACTAAACTGT TCTTCATANA ACAGCCCATA TTATTATCAA ATTAAGAGAC AATGTATTCC 24 0 

AATATCCTTT ANGGCCAATA TATTTNATGT CCCTTAATTA AGAGCTACTG TCCGT 2 95 



(2) INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



GACGCTTGGC CACTTGACAC TGCAGTGGGA AAACCAGCAT GAGCCGCTGC CCCCAAGGAA 60 

CCTCGAAGCC CAGGCAGAGG ACCAGCCATC CCAGCCTGCA GGTAAAGTGT GTCACCTGTC 12 0 

AGGTGGGCTT GGGGTGAGTG GCTGGGGGA/^ GTGTGTGTGC AAAGGGGGTG TNAATG™TA 180 

TGCGTGTGAG CATGAGTGAT GGCTAGTGTG ACTGCATGTC AGGGA.GTGTG AACAAGCGTG 24 0 

CGGGGGTGTG TGTGCAAGTG CGTATGCATA TGAGAATATG TGTCTGTGGA TGAGTGCATT 30 0 

TGAAAGTCTG TGTGTGTGCG TGTGGTCATG ANGGTAANTT ANTGACTGCG CAGGATGTGT 3 60 

GAGTGTGCAT GGAACACTCA NTGTGTGTGT CAAGTGGCCN ANCGTC 4 06 
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(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

TGACGCTTGG CCACTTGACA CACTAAAGGG TGTTACTCAT CACTTTCTTC TCTCCTCGGT 6 0 

GGCATGTGAG TGCATCTATT CACTTGGCAC TCATTTGTTT GGCAGTGAqT GTAANCCANA 120 

TCTGATGCAT ACACCAGCTT GTAAATTGAA TAAATGTCTC TAATACTATG TGCTCACAAT 180 

ANGGTANGGG TGAGGAGAAG GGGAGAGA , - 2 08 

(2) INFORMATION FOR SEQ ID NO: 158: : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single. . 

(D) TOPOLOGY: linear : : • • . • 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 158: 

CTTCAACCTC CTTCAACCTC CTTCAACCTC CTGGATTCAA ACAATCATCC CACCTCAGAC 6 0 

TCCTTAGTAG CTGAGACTAC AGACTCACQC CACTACATCT GGCTAAATTT TTGTAGAGAT 12 0 

AGGGTTTCAT CATGTTGCCC TGGCTGGTCT CAAACTCCTG ACCTCAAGCA ATGTGCCCAC 180 

CTCAGCCTCC CAAAGTGCTG GGATTACAGG CATAAGCCAC CATGCCCAGT CCATNTTTAA 24 0 

TCTTTCCTAC 'CACATTCTTA. CCACACTTTC TTTTATGTTT AGATACATAA ATGCTTACCA 3 00 

TTATGATACA ATTGCCCACA GTATTAAGAC AGTAACATGC TGCACAGGTT TGTAGCCTAG 3 60 

GAACAGTAGG CAATACCACA TAGCTTAGGT GTGTGGTAGA CTATACCATC TAGGTTTGTG 42 0 

TAAGTTACAC TTTATGCTGT TTACACAATG ACAAAACCAT CTAATGATGC ATTTCTCAGA 4 80 

ATGTATCCTT GTCAGTAAGC TATGATGTAC AGGGAACACT GCCCAAGGAC ACAGATATTG 54 0 

TACCTGT 54 7 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

GCTCCTCTTG CCTTACCAAC TCACCCAGTA TGTCAGCAAT TTTATCRGCT TTACCTACGA 60 

AACAGCCTGT ATCCAAACAC TTAACACACT CACCTGAAAA GTTCAGGCAA CAATCGCCTT 12 0 

CTCATGGGTC TCTCTGCTCC AGTTCTGAAC CTTTCTCTTT - TCGTAGAACA TGCATTTARG 180 

TCGATAGAAG TTCCTCTCAG TGC 2 03 

(2) INFORr4ATI0N FOR SEQ ID NO: 160: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 402 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

TGTAAGTCGA GCAGTGTGAT GGGTGGAACA GGGTTGTAAG CAGTAATTGC AAACTGTATT 6 0 

TAAACAATAA TAATAATATT TAGCATTTAT AGAGCACTTT ATATCTTCAA AGTACTTGCA 12 0 

AACATTAYCT AATTAAATAC CCTCTCTGAT TATAATCTGG ATACAAATGC ACTTAAACTC 180 

AGGACAGGGT CATGAGARAA GTATGCATTT GA.^JVGTTGGT GCTAGCTATG CTTTAAAAAC 24 0 

CTATACAATG ATGGGRAAGT TAGAGTTCAG ATTCTGTTGG ACTGTTTTTG TGCATTTCAG 3 00 

TTCAGCCTGA TGGCAGAATT AGATCATATC TGCACTCGAT GACTYTGCTT GATAACTTAT 36 0 

CACTGAAATC TGAGTGTTGA TCATCACACT GCTCGACTTA CA 4 02 

(2) INFORMATION FOR SEQ ID NO: 161: 

{ 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 193 base pairs 
(P) TYPE: nucleic acid 
(C) GTRA1«JDEDNESS : single 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

AGCATGTTGA GCCCAGACAC TGACCAGGAG AAAAACCAAC CAATAGAAAC ACGCCCAGAC 6 0 

ACTGACCAGG AGAAAAACCA ACCAATAAAA ACAGGCCCGG ACATAAGACA AATAATAAAA 12 0 

TTAGCGGACA AGGACATGAA AACAGCTATT GTAAGAGCGG ATATAGTGGT GTGTGTCTGG 180 

GCTCAACATG CTA 193 

(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

TGTTGAGCCC AGACACTGAC CAGGAGAAAA ACCAACCAAT AAAAACAGGC CCGGACATAA 6 0 

GACAAATAAT AAAATTAGCG GACAAGGACA TGAAAACAGC TATTGTAAGA GCGGATATAG 12 0 

TGGTGTGTGT CTGGGCTCAA CATGCTA 14 7 

(2) INFORMATION FOR SEQ ID NO:163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEOUENCE DESCPIPTiON: SEQ ID NO: 16 3: 



TAGCATGTTG AGCCCAGACA CAAATCTTTC CTTAAGCAAT AAATCATTTC TGCATATGTT 6 0 

TTTAAAACCA CAGCTAAGCC ATGATTATTC AAAAGGACTA TTGTATTGGG TATTTTGATT 12 0 

TGGGTTCTTA TCTCCCTCAC ATTATCTTCA TTTCTATCAT TGACCTCTTA TCCCAGAGAC 18 0 

TCTCAAACTT TTATGTTATA CAAATGACAT TCTGTCTCAA AAAATATCTC ACGCACTTCT 24 0 

GTTCTGTTTC TGCGTGTGTA TGTGTGTGTG TGTGTGTCTG GGCTCAACAT GCTA 2 94 



(2) INFORMATION FOR SEQ ID NO: 164: 

(1) SEOUENCE CHARACTERISTICS-: 

(A) LENGTH: 412 base pairs 
(LM TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 



CGGGATTGGC TTTGAGCTGC AGATGCTGCC TGTGACCGCA CCCGGCGTGG AACAGAAAGC 6 0 

CACCTGGCTG CAAGTGCGCC AGAGCCGCCC TGACTACGTG CTGCTGTGGG GCTGGGGCGT 12 0 

GATGAACTCC ACCGCCCTGA AGGAAGCCCA GGCCACCGGA TACCCCCGCG ACAAGATGTA 180 

CGGCGTGTGG TGGGCCGGTG CGGAGCCCGA TGTGCGTGAC GTGGGCGAAG GCGCCAAGGG 24 0 

CTACAACGCG CTGGCTCTGA ACGGCTACGG CACGCAGTCC AAGGTGATCC AlsFGACATCCT 3 00 

GAAACACGTG CACGACAAGG GCCAGGGCAC GGGGCCCAAA GACGAAGTGG GCTCGGTGCT 36 0 

GTACACCCGC GGCGTGATCA TCCAGATGCT GGACAAGGTG TCAATCACTA AT 412 



(2) INFORMA.TION FOR SEQ ID NO: 165: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIvJDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165: 



TTGACACCTT GTCCAGCATC TGCATCTGAT GAGAGCCTCA GATGGCTACC ACTAATGGCA 6 0 

GAAGGCAAAG GAGAACAGGC ATTGTATGGC AAGAAAGGAA GAAAGAGAGA GGGGAGAAAG 12 0 

GTGCTAGGTT CTTTTCAACA ACCAGTTCTT GATGGAACTG AGAGTAAGAG CTCAAGGCCA 180 

GGTGTGGTGA CTCCAACCAG TAATCCCAAC ATTTTAGGAG GCTGAGGCAG GCAGATGTCT 24 0 

TGACCCCATG AGTTTGTGAC CAGCCTGAAC AACATCATGA GACTCCATCT CTACAATAAT 3 00 

TACAA.AA-Z^TT AATCAGGCAT TGTGGTATGC CCTGTAGTCC CAGATGCTGG ACAAGGTGTC 3 60 

A 361 



(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(XI ^ SEQUENCE DESCRIPTION: SEQ ID NO: 166: 



TWGACTGACT CATGTCCCCT ACACCCAACT ATCTTCTCCA GGTGGCCAGG CATGATAGAA 6 0 

TCTGATCCTG ACTTAGGGGA ATATTTTCTT TTTACTTCCC ATCTTGATTC CCTGCCGGTG 12 0 

AGTTTCCTGG TTCAGGGTAA GAAAGGAGCT CAGGCCAAAG TAATGAACAA ATCCATCCTC 180 

ACAGACGTAC AGAATAAGAG AACWTGGACW TAGCCAGCAG AACMCAAKTG AAAMCAGAAC 24 0 

MCTTAMCTAG GATRACAAMC MCRRARATAR KTGCYCMCMC WTATAATAGA AACCAAACTT 3 00 

GTATCTAATT AAATATTTAT CCACYGTCAG GGCATTAGTG GTTTTGATAA ATACGCTTTG 360 

GCTAGGATTC CTGAGGTTAG AATGGAARAA CAATTGCAMC GAGGGTAGGG GACATGAGTC 42 0 

AKTCTAA 427 



(2) INFORMATION FOR SEQ ID NO: 167: 

(j) SEQUENCE CfiAJ^CTEPISTICS : 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s i ng I e 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NG:167: 



AACGTCGCAT GCTCCCGGCC GCCATGGCCG CGGGATAGAC TGACTCATGT CCCCTA.AGAT 6 0 

AGAGGAGACA CCTGCTAGGT GTAAGGAGAA GATGGTTAGG TCTACGGAGG CTCCAGGGTG 12 0 

GGAGTAGTTC CCTGCTA/^jCG GAGGGTAGAC TGTTCAACCT GTTCCTGCTC CGGCCTCCAC 180 

TATAGCAGAT GCGAGCAGGA GTAGGAGAGA GGGAGGTAAG AGTCAGAAGC TTATGTTGTT 24 0 

TATGCGGGGA AACGCCRTAT CGGGGGCAGC CRAGTTATTA GGGGACANTR TAGWYARTCW 3 00 

AGNTAGCATC CAAAGCGNGG GAGTTNTCCC ATATGGTTGG ACCTGCAGGC GGCCGCATTA 3 60 

GTGATTAGCA TGTGAGCCCC AGACACGCAT AGCAACAAGG ACCTAAACTC AGATCCTGTG 42 0 

CTGATTACTT AACATGAATT ATTGTATTTA TTTAACAACT TTGAGTTATG AGGCATATTA 4 80 

TTAGGTCCAT ATTACCT3GA 500 



(2) INFORMATION FOR SEQ ID NO: 168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO. 168: 



TTCATCGCTC GGTGACTCAA GCCTGTAATC CCAGAACTTT GGGAGGCCGA GGGGAGCAGA 60 

TCACCTGAGG TTGGGAGTTT GAGACCAGCC TGGCCAACAT GGTGACAACC CGTCTCTGCT 12 0 

AAAAATACAi\ AAATTAGCCA AGCATGGTGG CATGCACTTG TAATCCCAGC TACTCGGGAG 180 

GCTGAGGCAG GAGAATCACT TGAGGCCAGG AGGCAGAGGT TGCAGTGAGG CAGAGGTTGA 24 0 

GATCATGCCA CTGCACTCCA GCCTGGGCAA CAGAGTAAGA CTCCATCTCA AAAAAAAAAA 3 00 

AAAAAAAGAA TGATCAGAGC CACAAATACA GAAAACCTTG AGTCACCGAG CGATGAAA 358 
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(2) INFORMATION FOR SKQ ID NO: 16 9; 

(i) SEQUENCE CHARACTERISTICS: 

(A; LENGTH: 1265 base pairs 
{ B ) TYPE : nucleic acid 

( C ) STRANDEDNES S single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: S£Q ID NO: 169: 

TTCTGTCCAC ACCAATCTTA GAGCTCTGAA AGAATTTGTC TTTAAATATC TTTTAATAGT 60 

T^CATGTATT TTATGGACCA AATTGACATT TTCGACTATT TTTTCCCAAA AAAAGTCAGG 12 0 

TGAATTTCAG CACACTGAGT TGGGAATTTC TTATCCCAGA AGWCGGCACG AGCAATTTCA 18 0 

TATTTATTTA AGATTGATTC CATACTCCCT TTTCAAGGAG AATCCCTGCA GTCTCCTTAA 24 0 

AGGTAGAACA /^.ATACTTTCT ATTTTTTTTT CACCATTGTG GGATTGGACT TTAAGAGGTG 3 00 

ACTCTAAAAA ^J^CAGAGAAC AAATATGTCT CAGTTGTATT AAGCACGGAC CCATATTATC 3 60 

ATATTCACTT ^J^JKA^^^JkATGA TTTCCTGTGC ACCTTTTGGC AACTTCTCtT TTCAATGTAG 420 

GGAAAAACTT AGTCACCCTG AAAACCCACA AAATAAATAA AACTTG-TAGA TGTGGGCAGA 480 

ARGTTTGGGG GTGGACATTG TATGTGTTTA AATTAAACCC TGTATCACTG AGAAGCTGTT 54 0 

GTATGGGTCA GAGAAJiJVTGA ATGCTTAGAA GCTGTTCACA TCTTCAAGAG CAGAAGCAAA 60 0 

CCACATGTCT CAGCTATATT ATTATTTATT TTTTATGCAT AAAGTGAATC ATTTCTTCTG 660 

TATTAATTTC CAAAGGGTTT TACCCTCTAT TTAAATGCTT TGAAAAACAG TGCATTGACA 72 0 

ATGGGTTGAT ATTTTTCTTT AAAAGAAAAA TATAATTATG AAAGCCAAGA TA-ATCTGAAG 7 80 

CCTGTTTTAT TTTAAAA.CTT TTTATGTTCT GTGGTTGATG TTGTTTGTTT GTTTGTTTCT 84 0 

ATTTTGTTGG TTTTTTACTT TGTTTTTTGT TTTGTTTTGT TTTGGTTTDG CATACTACAT 90 0 

GCAGTTTCTT TAACCAATGT CTGTTTGGCT AATGTAA.TTA AAGTTGTTAA TTTATATGAG 96 0 

TGCATTTCAA CTATGTCA-AT GGTTTCTTAA TATTTATTGT GTAGAAGTAC TGGTAATTTT 102 0 

TTTATTTACA ATATGTTTAA AGAGATAACA GTTTGATATG TTTTCATGTG TTTATAGCAG 1080 

AAGTTATTTA TTTCTATGGC: ATTCCAGCGG ATATTTTGGT GTTtGCGAGG CATGCAGTCA 114 0 

ATATTTTGTA CAGTTAGTGG ACAGTATTCA GCAACGCCTG ATAGCTTCTT TGGCCTTATG 1200 

TTAAATAAAA AGACCTGTTT GGGATGTAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 12 6 0 
AAAAA. ■ 

(2) INFORMATION FOR SEQ ID NO: 170: ' 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 383 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

TGTAAGTCGA GCAGTGTGAT GACGATATTC TTCTTATTAA TGTGGTAATT GAACAAATGA 
TCTGTGATAC TGATCCTGAG CTAGGAGGCG CTGTICAGTT ^ATGGGACTT CTTCGTACTC 
TAATTGATCC AGAGAJi^CATG CTGGCTACAA CTAATAAAAC CGAAAAAAGT GAATTTCTAA 
ATTTTTTCTA CAACCATTGT ATGCATGTTC TCACAGCACC ACTTTTGACC AATACTTCAG 
AAGACAAATG TGAAAAGGAT AATATAGTTG GATCAAACAA AAACAACACA ATTTGTCCCG 
ATAATTATCA" AACAGCACAG CTACTTGCCT TAATTTTAGA GTTACTCACA TTTTGTGTGG 
AACATCACAC TGCTCGACTT ACA 

(2) INFORMATION FOR SEQ ID NO: 171: 
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( i ) SEQUENCE CliARACTERISTICS : 

(A) LEriGTH : 383 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

TGGGCACCTT CAATATCGCA AGTTAAAAAT AATGTTGAGT TTATTATACT TTTGACCTGT 6 0 

TTAGCTCAAC AGGGTGAAGG CATGTAAAGA ATGTGGACTT CTGAGGAATT TTCTTTTAAA 12 0 

AAGAACATAA TGAAGTAACA TTTTAATTAC TCAAGGACTA CTTTTGGTTG AAGTTTATAA 180 

TCTAGATACC TCTACTTTTT GTTTTTGCTG TTCGACAGTT CACAAAGACC TTCAGCAATT 24 0 

TACAGGGTAA AATCGTTGA_A GTAGTGGAGG TGAAACTGAA ATTTAAAATT ATTCTGTAAA 3 00 

TACTAT-AGGG AAAGAGGCTG AGCTTAGAAT CTTTTGGTTG TTCATGTGTT CTGTGCTCTT 3 60 

ATCATCACAC TGCTCGACTT ACA 3 83 

(2) INFORMATION FOR SEQ ID NO: 17 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STP-ANDEDNEGS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 172: 



TCGGGTGATG 


CCTCCTCAGG 


CTTGTCGTTA 


GTGTACACAG 


AGCTGCTCA.T 


GAAGCGACAG 


60 


CGGCTGCCCC 


TGGCACTTCA 


GAACCTCTTC 


CTCTACACTT 


TTGGTGCGCT 


TCTGAATCTA 


120 


GGTCTGCATG 


CTGGCGGCGG 


CTCTGGCCCA 


GGCCTCCTGG 


AAAGTTTCTC 


AGGATGGGCA 


180 


GCACTCGTGG 


TGCTGAGCCA 


GGCACTAAAT 


GGACTGCTCA 


TGTCTGCTGT 


CATGGAGCAT 


240 


GGCAGCAGCA 


TCACACGCCT 


CTTTGTGGTG 


TCCTGCTCGC 


TGGTGGTCAA 


CGCCGTGCTC 


300 


TCAGCAGTCC 


TGCTACGGCT 


GCAGCTCACA 


GCCGCCTTCT 


TCCTGGCCAC 


ATTGCTCATT 


360 


GGCCTGGCCA 


TGCGCCTGTA 


CTATGGCAGC 


CGCTAGTCCC 


TGACAACTTC 


CACCCTGATT 


420 


CCGGACCCTG 


TAGATTGGGC 


GCCACCACCA 


GATCCCCCTC 


CCAGGCCTTC 


CTCCCTCTCC 


480 


CATCAGCGGC 


CCTGTAJVCAA 


GTGCCTTGTG 


AGAAAAGCTG 


GAGAAGTGAG 


GGCAGCCAGG 


540 


TTATTCTCTG 


GAGGTTGGTG 


GATGAAGGGG 


TACCCCTAGG 


AGATGTGAAG 


TGTGGGTTTG 


600 


GTTAAGGAAA 


TGCTTACCAT 


CCCCCACCCC 


CAACCAAGTT 


NTTCCAGACT 


AAAGAATTAA 


660 


GGTAACATCA 


ATACCTAGGC 


CTGAGGAGGC 


ATCACCCGA 






699 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHAJ^CTERISTICS : 

(A) LENGTH: 701 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 

TCGGGTGATG CCTCCTCAGG CCAGATCAAA CTTGGGGTTG 7UVAACTGTGC AAAGAAATCA 60 
ATGTCGGAGA AAGAATTTTG CAAAAGAAAA ATGCCTAATC AGTACTAATT TAATAGGTCA 12 0 



BNSDOCID <WO 9B4S3?eA? I > 



wo 98/45328 



PCT/IIS98/06939 



93 





nn A Art A A PA a 




1 J. iAitjlLAL) 


C I ATTTTATA 


ATCACCAGAG 


180 


i Vjk^ i. i /-iVJ 1 1 


PATPT A APPP 






AAGAACAATT 


TTATTCGTCG 


24 0 




TTP A AT'^T'ZiT 


APPRTPTTA A 




TTTTAAATTT 


TGATAAAGAA 


3 00 


AGCTCCACTT 


TTGGCAGGAG 


TAGGGGGCAG 


GGAGAGAGGA 


GGCTCCATCC 


ACAAGGACAG 


360 


AGACACCAGG 


GCCAGTAGGG 


TAGCTGGTGG 


CTGGATCAGT 


CACAACGGAC 


TGACTTATGC 


420 


CATGAGAAGA 


AACAACCTCC 


AAATCTCAGT 


TGCTTAATAC 


AACACAAGCT 


CATTTCTTGC 


480 


TCACGTTACA 


TGTCCTATGT 


AGATCAACAG 


CAGGTGACTC 


AGGGACCCAG 


GCTCCATCTC 


540 


CATATGAGCT 


TCCATAGTCA 


CCAGGACACG 


GGCTCTGAAA 


GTGTCCTCCA 


TGCAGGGACA 


600 


CATGCCTCTT 


CCTTTCATTG 


GGCAGAGCAA 


GTCACTTATG 


GCCAGAAGTC 


ACACTGCAGG 


660 


GCAGTGCCAT 


CCTGCTGTAT 


GCCTGAGGAG 


GCATCACCCG 


A. 




701 



(2) INFORMATION FOR SEQ ID NO: 174: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 



TCGGGTGATG 


CCTCCTCANG 


CCCCTAAATC 


AGAGTCCAGG 


GTCAGAGCCA 


CAGGAGACAG 


60 


GGAAAGACAT 


AGATTTTAAC 


CGGCCCCCTT 


CAGGAGATTC 


TGAGGCTCAG 


TTCACTTTGT 


120 


TGCAGTTTGA 


ACAGAGGCAG 


CAAGGCTAGT 


GGTTAGGGGC 


ACGGTCTCTA' 


AAGCTGCACT 


180 


GCCTGGATCT 


GCCTCCCAGC 


TCTGCCAGGA 


ACCAGCTGCG 


TGGCCTTGAG 


CTGCTGACAC 


240 


GCAGAAA-GCC 


CCCTGTGGAC 


CCAGTCTCCT 


CGTCTGTAAG 


ATGAGGACAG 


GACTCTAGGA 


300 


ACCCTTTCCC 


TTGGTTTGGC 


CTCACTTTCA 


CAGGCTCCCA 


TCTTGAACTC 


TATCTACTCT 


360 


TTTCCTGAAA 


CCTTGTAAAA 


GAAAAAAGTG 


CTAGCCTGGG 


CAACATGGCA 


AAACCCTGTC 


420 


TCTACAAAAA 


ATACAAAAAT 


TAGTTGGGTG 


TGGTGGCATG 


TGCCTGTAGT 


CCCAGCCACT 


480 


TGGGAGGTGC 


TGAGGTGGGA 


GGATCACTTG 'AGCCCGGGAG 


GTGGAGGTTG 


CAGTGAGCCA 


540 


AGATCATGCC 


actgcact6c 


AGCCTGAGTA 


-ATAGAGTAAG 


ACTCTGTCTC 


AAAAACAACA 


600 


ACAACAACAG 


TGAGTGTGCC 


TCTGTTTCCG 


GGTTGGATGG 


GGCACCACAT 


TTATGCATCT 


660 


CTCAGATTTG 


GACGCTGCAG 


CCTGAGGAGG 


CATCACCCGA 






700 



(2) INFORMATION FOR SEQ ID NO:175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

TATAGGGCGA ATTGGGCCCG AGTTGCATGN TCCCGGCCGC CATGGCCGCG GGATTCGGGT 6 0 

GATGCCTCCT CAGGCTTGTC TGCCACAAGC TACTTCtCTG AGCTCAGAAA GTGCCCCTTG 12 0 

ATGAGGGAAA ATGTCCTACT GCACTGCGAA TTTCTCAGTT CCATTTTACC TCCCAGTCCT 18 0 

CCTTCTAAAC CAGTTAATAA ATTCATTCCA CAAGTATTTA CTGATTACCT GCTTGTGCCA 24 0 

GGGACTATTC TCAGGCTGAA GAAGGTGGGA GGGGAGGGCG GAACCTGAGG AGCCACCTGA 300 

GCCAGCTTTA TATTTCAACC ATGGCTGGCC CATCTGAGAG CATCTCCCCA CTCTCGCCAA 3 60 

CCTATCGGGG CATAGCCCAG GGATGCCCCC AGGCGGCCCA GGTTAGATGC GTCCCTTTGG 420 

CTTGTCAGTG ATGACATACA CCTTAGCTGC TTAGCTGGTG CTGGCCTGAG GAGGCATCAC 4 80 

CCGA ' 484 
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(2) INFORMATION FOR SEQ ID NO : 1 7 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

TCGGGTGATG CCTCCTCAGG GCTCAAGGGA TGAGAAGTGA CTTCTTTCTG GAGGGACCGT 60 

TCATGCCACC CAGGATGAAJ^. ATGGATAGGG ACCCACTTGG AGGACTTGCT GATATGTTTG 12 0 

GACAAATGCC AGGTAGCGGA ATTGGTACTG GTCCAGGAGT TATCCAGGAT AGATTTTCAC 130 

CCACCATGGG ACGTCATCGT TCAAATCAAC TCTTCAATGG CCATGGGGGA CACATCATGC 24 0 

CTCCCACACA ATCGCAGTTT GGAGAGATGG GAGGCAAGTT TATGAAAAGC CAGGGGCTAA 3 00 

GCCAGCTCTA CCATAACCAG AGTCAGGGAC TCTTATCCCA GCTGCAAGGA CAGTCGAAGG 3 60 

ATATGCCACC TCGGTTTTCT AAGAAAGGAC AGCTTAATGC AGATGAGATT AGCCTGAGGA 4 20 

GGCATCACCC GA 4 32 

(2) INFORMATION FOR SEQ ID NO: 177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 177 : 

TAGCATGTTG AGCCCAGACA CAGTAGCATT TGTGCCAATT TCTGGTTGGA ATGGTGACAA 60 

CATGCTGGAG CCAAGTGCTA ACATGCCTTG GTTCAAGGGA TGGAAAGTCA CCCGTAAGGA 120 

TGGCAATGCC AGTGGAACCA CGCTGCTTGA GGCTCTGGAC TGCATCCTAC CACCAACTCG 180 

CCCAACTGAC AAGCCCTTGC GCCTGCCTCT CCAGGATGTC TACAAAATTG GTGGTATTGG 24 0 

TACTGTTCCT GTTGGCCGAG TGGAGACTGG TGTTCTCAAA CCCGGTATGG TGGTCACCTT 3 00 

TGCTCCAGTC AACGTTACAJV CGGAAGTAAA ATCTGTCGAA ATGCACCATG AAGCTTTGAG 3 60 

TGAAGCTCTT CCTGGGGACA ATGTGGGCTT CAATGTCAAG AATGTGTCTG TCAAGGATGT 4 20 

TCGTCGTGGC AACGTTGCTG GTGACAGCAA AAATGACCCA CCAATGGAAG CAGCTGGCTT 4 80 

CACTGCTCAG GTGATTATCC TGAACCATCC AGGCCAAATA AGTGCCGGCT ATGCCCCTGT 54 0 

ATTGGATTGC CACACGGCTC ACATTGCATG CAAGTTTGCT GAGCTGAAGG AAAAGATTGA 6 00 

TCGCCGTTCT GGTAAAAAGC TGGAAGATGG CCCTAAATTC TTGAAGTCTG GTGATGCTGC 66 0 

CATTGTTGAT ATGGTTCCTG GCAAGCCCAT GTGTGTTGAG AGCTTCTCAG ACTATCCACC 72 0 

TTTGGGTCGC TTTGCTGTTC GTGATATGAG ACAGACAGTT GCGGTGGGTG TCTGGGCTCA 7 80 

ACATGCTA 7 8 8 

(2) INFORMATION FOR SEQ ID NO: 178: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 



TAGCATGTTG 
CACTTGGGCT 
ATTGTTGTTC 
TATTTAATGA 
AGCCTACAAA 
GTTTCAGAGC 
CACTGTGCAC 
TACACACTTT 
GTGTGTTTAG 
GACACTTTCT 
GATTGTTAAA 
CCTTCTGTAG 
ATTGAACAGT 
ATGCTA 



AGCCCAGACA 
GCACTTTGAJ^ 
GTTTCTTTTC 
CAGAAACGTT 
AGGCACTTGT 



AGCCAGTGAT 
TCTTCTAGGT 
ATATATATGT 
TATTTTAGCT 
TGTTGACACC 
GCTTAATTAC 
TGTCGTAGGG 
ATGGCCTCAT 



TGACACACTT 
ATTAAAGGTT 
GGTACATTTC 



TATAAATGAA 



TGTTCCAGTC 
GTAAGGGTAT 
ATGTATGTAT 
TAGTGCAACT 
TTGAATATTA 
TATGATTTGT 
CTCTTACAGG 
TATATATTTT 



CCTGTGTTTC 



TGGGAGCTCT GGCAGTGGCG GATTCATAGG 

GGCTTTATTA GATTCACTAG TTTTTAAAAA 

TAATCAGACA GATCAGACAG CATAATTTTG 

TTCATGAATG AGCTTGCATT CTGAAGCAAG 

AGTTCTGGCT CTAGAGGCCA GTACTCTGGA 

AGTGATGCCT AGTTATATAG AGGAGGAGTA 

GCAACTTTGG ATCTTAAAAT TCTGTACACA 

GAAAACATCA AATTAGTTTG TCAAATATGT 

ATTTCCACAT TATTTATTAA ATTGATCTAA 

ATGTTCAAGG GTGCAATGTG TATTCCTTTA 

AGTAAATTAA CTTTTAAAAT GTATTTGAGC 

GTGGGAAAGA TTTTAJ\TTTT, CCAGTTGCTA 

GATTTATAGG AGTTTGTGTC TGGGCTCAAC 



786 



7 30 



180 
240 
3 00 
360 
420 
480 
540 
600 
660 



720 



120 



60 



(2) INFORMATION FOR SEQ ID NO: 17 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 



TAGCATGTTG 


AGCCCAGACA 


CTGGTTACAA 


GACCAGACCT 


GCTTCCTCCA 


TATGTAAACA 


60 


GCTTTTAAAA 


AGCCAGTGAA 


CCTTTTTAAT 


.ACTTTGGCAA 


CCTTCTTTCA 


CAGGCAAAGA 


120 


ACACCCCCAT 


CCGCCCCTTG 


TTTGGAGTGC 


AGAGTTTGGC 


TTTGGTTCTT 


TGCCTTGCCT 


180 


GGAGTATACT 


TCTAATTCCT 


GTTGTCCTGC 


ACAAGCTGAA 


TACCGAGCTA 


CCCACCGCCA 


240 


CCCAGGCCAG 


GTTTCCACTC 


ATTTATTACT 


TTATGTTTCT 


GTTCCATTGC 


TGGTCCACAG 


300 


AAATAAGTTT 


TCCTTTGGAG 


GAATGTGATT 


ATACCCCTTT 


AATTTCCTCC 


-p'P'P'PQQ'P'PTT 


360 


TTTTAATATC 


A.TTGGTATGT 


GTTTGGCCCA 


GAGGAAACTG 


AAATTCACCA 


tCATCTTGAC 


420 


TGGCAATCCC 


ATTACCATGC 




AAACGTAATT 


Tl'TCTTGCCT 


TACATTGGCA 


480 


GAGTAGCCCT 


TCCTGGCTAC 


TGGCTTAATG 


TAGTCACTCA 


GTTTCTAGGT 


GGCATTAGGC 


540 


ATGAGACCTG 


AAGCACAGAC 


TGTCTTACCA 


CAAAAGGTGA 


CAAGATCTCA 


AACCTTAGCC 


600 


AAAGGGCTAT 


GTCAGGTTTC 


AATGCTATCT 


GCTTCTGTTC 


CTGCtCACTG 


TTCTGGATTT 


660 


TGTCCTTCTT 


CATCCCTAGC 


ACCAGAATTT 


CCCAGTCTCC 


CTCCCTACCT 


TCCCTTGTTT 


720 


TAATTCTAAT 


CTATCAGCAA 


AATAACTTTT 


CAAATGTTTT 


AACCGGTATC 


TCCATGTGTC 


780 


TGGGCTCAAC 


ATGCTA 
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(2) INFORMATION FOR SEQ ID NO: 180: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 
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GGATGTGCTG CAAGGCGATT AAGTTGGGTA ACGGCAGGGT TTTCCCAGTC ACGACGTTGT 6 0 

AAAACGACGG CCAGTGAATT GTAATACGAC TCACTATAGG GCGAATTGGG CGCGACGTCG 12 0 

CATGCTCCCG GCCGCCATGG CCGCGGGATA GCATGTTGAG GCGAGACACC TGCAGGTGAT 180 

TTGGAGAGAT TTTTCAGGTT AGCAGCTTGA TGGTCTTTTT CAGGAGGAGA GACACTGAGC 24 0 

ACTCCCAAGG TGAGGTTGAA GATTTCCTCT AGATAGCCGG ATAAGAAGAC TAGGAGGGAT 3 00 

GCCTAGAAAA TGATTAGCAT GCAAATTTCT ACCTGCCATT TCAGAACTGT GTGTCAGCCC 3 60 

ACATTCAGCT GCTTCTTGTG AACTGAAAAG AGAGAGGTAT TGAGACTTTT CTGATGGCCG 42 0 

CTCTAACATT GTAACACAGT AATCTGTGTG TGTGTGGGTG TGTGTGTGTG TCTGGGCTCA 4 80 

ACATGCTA 4 8 8 

(2) INFORMATION FOR SEQ ID NO: 181: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

TAGCATGTTG AGCCCAGACA CGGCGACGGT ACCTGATGAG TGGGGTGATG GCACCTGTGA 60 

AAAGGAGGPJ^ CGTCATCCCC CATGATATTG GGGACCCAGA TGATGAACCA TGGCTCCGCG 120 

TCAATGCATA TTTA.\TCCAT GATACTGCTO ATTGGAAGGA CCTG;>.ACCTG AAGTTTGTGC 180 

TGCAGGTTTA TCGGGACTAT TACCTCACOG GTGA.TCAAAA CTTCCTr.^AG GACATGTGGC 24 0 

CTGTGTGTCT AGTAAGGGAT GCACATGCAG TGGCCAGTGT GCCAGGGGTA TGGTTGGTGT 3 00 

CTGGGCTCAA CATGCTA 317 

(2) INFORMATION FOR SEQ ID NO: 182: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:182: 

TAGCATGTTG AGCCCAGACA CTGGCTGTTA GCCAAATCCT CTCTCAGCTG CTCCCTGTGG 6 0 

TTTGGTGACT CAGGATTACA GAGGCATCCT GTTTCAGGGA ACAAAAAGAT TTTAGCTGCC 12 0 

AGCAGAGAGC ACCACATACA TTAGAATGGT AAGGACTGCC ACCTCCTTCA AGAACAGGAG 180 

TGAGGGTGGT GGTGAATGGG AATGGAAGCC TGCATTCCCT GATGCATTTG TGCTCTCTCA 24 0 

AATCCTGTCT TAGTCTTAGG AAAGGAAGTA AAGTTTCAAG GACGGTTCCG J^JkCTGCTTTT 300 

TGTGTCTGGG CTCAACATGC TATCCCGCGG CCATGGCGGC CGGGAGCATG CGACGTCGGG 3 60 

CCCAATTCGC CCTATAGTGA GTCGTATTAC AATTCACTGG CCGTCGTTTT ACAACGTCGT 42 0 

GACTGGGAAA ACCCTGGCGT TACCCAACTT AATCGCCTTG CA.GCACATCC CCCTTTCCCA 4 80 

GCTGGCGTAA TAIJCGAAAAG GCCCGCA 507 

(2) INFORMATION FOR SEQ ID NO: 18 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ TD NO: 183: 

GATTTACGCT GCAACACTGT GGAGGTAGCC CTGGAGCAAG GCAGGCATGG ATGCTTCTGC 

AATCCCCAAA TGGAGCCTGG TATTTCAGCC AGGAATCTGA GCAGAGCncC CTCTAATTGT 

AGTTATTCTC TTTGTTCTTC A.Z.CCTTCCAA TAGCCTTGAG CTTCCAGGGQ 
AGTGTCGTTA ATCATTACAG CCTGGTCTCC ACAGTGTTGC AGCGTAA 

(2) INFORMATION FOR SEQ ID NO: 184: 



60 
120 

180 
227 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTACGCTGCA ACACTGTGGA GCAGATTAAC ATCAGACTTT TCTATCAACA TGACTGGGGT 
TACTAAAAAG ACAACAAATC AATGGCTTCA / AAAGTCTAAG ' GAATAATTTC GATACTTCAA 
CTTTATAAAA CCTGACAAAA . CTAT.CAATCA.7AGCATAAAGA CAGATGAAGA ACATTTCCAG 
ATT-TTGGCCA ATCAGATATT TTACCTCCAC ' AGTGTTGCAG CGTAA ^. ' 

(2) INFORMATION FOR SEQ ID NO: 185: 



60 
120 

180 
225 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single ^ 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185: 

GGCCCGACGT CGCATGCTCC CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 
CTGGGACCCA TAGGCTAGTC AGAGTATTTA GAGTTGAGTT CCTTTCTGCT TCCCAGAATT 
TGAAAGAAAA GGAGTGAGGT. GATAGAGCTG AGAGATCAGA TTTGCCTCTG AAGCCTGTTC 
AAGATGTATG TGCTCAGACC CCACCACTGG GGCCTGTGGG TGAGGTCCTG GGCATCTATT 
TGAATGAATT GCTGAAGGGG AGCACTATGC CAAGGAAGGG GAACCCATCC TGGCACTGGC 
ACAGGGGTCA CCTTATCCAG TGCTCAGTGC TTCTTTGCTG CTACCTGGTT TTCTCTCATA 
TGTGAGGGGC AGGTAAGAAG AAGTGCCCRG TGTTGTGCGA GTTTTAGAAC ATCTACCAGT 
AAGTGGGGAA GTTTCAGAAA GCAGCAGCTT TGTTTTGTGT ATTTTCACCT TCAGTTAGAA 
GAGGAAGGCT GTGAGATGAA TGTTAGTTGA GTGGAAAAGA CGGGTAAGCT TAGTGGATAG 
AGACCCTAAC GAATCACTAG . TGCGGCCGCC TTGCAGGTCG ACCATATGGG AGAGCTC 

(2) INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 597 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
360 
420 
480 
540 
597 
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(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 



GGCCCGAAGT 
CTACCTAAAA 
ACCCCAGAGG 
TATATCATTT 
TGAACCACTA 
AACATCAAGT 



TGCATGTTCC 
AATCCCAAAC 
CCTACAGATC 



TTACGAACGA 
GCAGTAAATA 
TCGGTAGCAG 
ATACATGTAG 
TCCTTTGCTG 
TCACTAGTGC 



TCCAAGA.TTT 



CGGCCGCCAT GGCCGCGGGA TTCGTTAGGG TCTCTATCCA 

ATATAACTGA ACTCCTCACA CCCAATTGGA CCAATCCATC 

CTCCTTTGAT ACATAAGAAA ATTTCCCCAA ACTACCTAAC 

GTTTTACCAA ATTTTGATGG CCTTTCTGAG CTTGTCAGTG 

TCGGATATTA ACTGCCCCTC ACCGTCCAGG TGTAGCTGGC 

TTCATTAAGT TTTCACCTAC TAAGGTGvCTT AAACACCCTA 

ATCTTTTGAT TTGTTTTTAT TTCCCATAAG GGTCCTGTTC 

TGTGAGCAGC TAGTCACTAT CGCATGACTT GGAGGGTGAT 

TTAAAGAACT CTTGTCCCAG CCTGTCAAAG TGGATAGAGA 

GGCCGCCTGC AGGTCGACCA TATGGGAGAG CTCCCAA 



300 
360 
420 
480 
540 
597 



240 



60 
120 
180 



GGGTGCCATG 



AAGGTCAATC 
AATAGAGGCC 



CCCTAACGAA 



(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 324 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

TCGTTAGGGT CTCTATCCAC TTGCAGGTiZ^A AATCCAATCC TGTGTATATC TTATAGTCTT 60 

CCATATGTAG TGGTTCAAGA GACTGCAGTT CCAGAAAGAC TAGCCGAGCC CATCCATGTC 12 0 

TTCCACTTAA CCCTGCTTTG GGTTACACAT CTTAA.CTTTT CTGTTCAAGT TTCTCTGTGT 180 

AGTTTATAGC ATGAGTATTG GGAWAATGCC CTGAAACCTG ACATGAGATC TGGGAAACAC 240 

AAACTTACTC AATAAGAATT TCTCCCATAT TTTTATGATG GAAAAATTTC ACATGCACAG 3 00 

AGGAGTGGAT AGAGACCCTA ACGA 3 24 

(2) INFORMATION FOR SEQ ID NO : 1 8 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

GCGCGGGGAT TCGGGGTGAT ACCTCCTCAT GCCA/vAATAC AACGTNTAAT TTCACAACTT 6 0 

GCCTTCCAAT TTACGCATTT TCAATTTGCT CTCCCCATTT GTTGAGTCAC AACAAACACC 12 0 

ATTGCCCAGA AACATGTATT ACCTAACATG CACATACTCT TAAAACTACT CATCCCTT 17 8 

(2) INFORMATION FOR SEQ ID NO: 18 9: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 367 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 9 : 



TGACACCTTG TCCAGCATCT GACACAGTCT TGGCTCTTGG AAAATATTGG ATAAATGAAA 

ATGAATTTCT TTAGCAAGTG GTATAAGCTG AGAATATACG TATCACATAT CCTCATTCTA 

AGACACATTC AGTGTCCCTG AAATTAGAAT AGGACTTACA ATAAGTGTGT' TCACTTTCTC 

AATAGCTGTT ATTCAATTGA TGGTA<3GCCT TAAAAGTCAA AGAAATGAGA GGGCATGTGA 

AAAAAAGCTC AACATCACTG ATCATTAGAA AACTTCCATT CAAACCCCCA ATGAGATACC 

ATCTCATACC AGTCAGAATG GCTATTATTA AAAAGTCAAA AAATAACAGA TGCTGGACAA 
GGTGTCA 



360 
367 



180 
240 



300 



120 



60 



(2) INFORMATION FOR SEQ ID NO: 190: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 369 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: 

GACACCTTGT CCAGCATCTG ACAACGCTAA CAGCCTGAGG AGATCTTTAT TTATTTATTT 6 0 

AGTTTTTACT CTGGCTAGGC AGATGGTGGC TAAAACATTC ATTTACCC At ^ TTATTCATTT 12 0 

AATTGTTCCT GCAAGGCCTA TGGATAGAGT ATTGTCCAGC ACTGCTCTGG AAGCTAGGAG 180 

CATGGGGATG AACAAGAtAG- GCTACATGCT GTTCCCACAG AACTTCCACT TTAGTCTGGG 24 0 

AAACAGATGA TATATACAAA TATATAAATG AATTCAGGTA GTTTTAAGTA CGAAAAGAAT 3 00 

AAGAAAGCAG AGTCATGATT TANAATGCTG GAAACAGGGG CTATTGCTTG^ AGATATTGAA 3 60 

GGTGCCCAA ^ 3 69 

(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: ' -'^ ' ' 

(A) LENGTH: 369 base pairs 
(E) TYPE: nucleic acid 
{C) STRANDEDNESS: single ' 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGACACCTTG TCCAGCATCT GCACAGGGAA AAGAAACTAT TATCAGAGTG AACAGGCAAC 6 0 

CTACAGAATG GGAGAAAATT TTTGCAATCT ATCCATCTGA CAAAGGGCTA ATATCCAGAA 120 

TCTACAAA-GA ACTTATACAA" ATTTACAAGA AACAAACAAA CAAACAACTC CTCAAAAAGT 180 

GGGTGAAGGA TGTGAACAGA CACTTCTCAA AAGAAGACAT TTATGGGGCC AACAAACATA 24 0 

TGAAAAAAAG - CTCATCATCA CTGGTCACTA GATAAATGGA AATCAAAACC ACAATGAGAT 3 00 

ACCATCTCAT TCCAGTTAGA ATGGCAATCA TTAAAAAGTC AGGAAACAAC AGATGCTGGA 3 60 

CAAGGTGTC 3 69 

(2) INFORMATION FOR SEQ ID NO: 192: ' ' v 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 449 base pairs 

(B) TYPE: nucleic acid 
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(C) STRAJJDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TGACGCTTGG CCACTTGACA CTTCATCTTT GCACAGAAAA ACTTCTTTAC AGATTTAATT 60 

CAAGACTGGT CTAGTGACAG TCCTCCAGAC ATTTTTTCAT TTGTTCCATA TACGTGGAAT 12 0 

TTTAAAATCA TGTTTCATCA GTTTGAAATG ATTTGGGCTG CTAATCAACA CAATTGGATC 180 

GACTGTTCTA CTAAACAACA GGAAAATGTG TATCTGGCAG CCTGTGGAGA AACACTAAAC 24 0 

ATTGATTTTT CTTTGCCTTT TACGGACTTT GTTCCAGCTA CATGTAATAC CAAGTTCTCT 3 00 

TTAAGAGGAG AAGATGTTGA TCTTCATTTG TTTCTACCAG ACTGCCACCC TAGTAAATAT 3 60 

TCTTTATTTA TGCTGGTAAA AAATTGCCAT CCAAATAAGA TGATTCATGA TACTGGTATT 4 20 

CCTGCTGAGT GTCAAGTGGC CAAGCGTCA 44 9 

(2) INFORMATION FOR SEQ ID NO: 193: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

TGACGCTTGG CCACTTGACA CCAGGGATGT AKCAGTTGAJ\ TATAjATCCTG CA-ATTGTACA 60 

TATTGGCAAT TTCCCATCAA ACATTCTAGA AAGAGACAAC CAGGATTGCT AGGCCATAAA 12 0 

AGCTGCAATA -AATAACTGGT AATTGCAGTA ATCATTTCAG GCCAATTCAA TCCAGTTTGG 180 

CTCAGAGGTG CCTTTGGCTG AGA.GAAGAGG TGAGATATAA TGTGTTTTCT TGCAACTTCT 24 0 

TGGAAGAATA ACTCCACAAT AGTCTGAGGA CTAGATACAA ACCTATTTGC CATTAAAGCA 3 00 

CCAGAGTCTG TTAATTCCAG TACTGATAAG TGTTGGAGAT TAGACTCCAG TGTGTCAAGT 360 

GGCCAAGCGT CA 3 72 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

TGACGCTTGG CCACTTGACA CTTATGTAGA ATCCATCGTG GGCTGATGCA AGCCCTTTAT 6 0 

TTAGGCTTAG TGTTGTGGGC ACCTTCAATA TCACACTAGA GACAAACGCC ACAAGATCTG 12 0 

CAGAAACATT CAGTTCTGAN CACTCGAATG GCAGGATAAC TTTTTGTGTT GTAATCCTTC 180 

ACATATACAA AAACAAACTC TGCANTCTCA CGTTACAAAA AAACGTACTG CTGTAAAATA 24 0 

TTAAGAAGGG GTAAAGGATA CCATCTATAA C7VAAGTAACT TACAACTAGT GTCAAGTGGC 3 00 

CAAGCGTCA ^ 0 ^ 

(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195: 

TGACGCTTGG CCACTTGACA CCCAATCTCG CACTTCATCC TCCCAGCAC^ TGATGAAGTA 
GGACTGCAAC TATCCCCACT TCCCAGATGA GGGGACCAAN GTACACATTA GGACCCGGAT 
GGGAGCACAG ATTTGTCCGA TCCCAGACTC CAAGCACTCA GCGTCACTCC AGGACAGCGG 
CTTTCAGATA AGGTCACAAA CATGAATGGC TCCGACAACC GGAGTCAGTC CGTGCTGAGT 
TAAGGCAATG GTGACACGGA TGCACGTGTN ACCTGTAATG GTTCATCGTA AGTGTCAAGT 
GGCCAAGCGT CA 

(2) INFORMATION FOR SEQ ID NO : 196 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

TGTATCGACG TAGTGGTCTC CTCAGCCATG CAGAACTGTG ACTCAATTAA ACCTCTTTCC 

TTTATGAATT ACCCAATCTC GGGTAGTGTC TTTATAGT^AG TGTGAGAATG GACTAATACA 

AGTACATTtT ACTTAGTAAT AATAATAAAC AAATATATTA CATTTTTGTG TATTTACTAC 

ACCATATTTT TTATTGTTAT TGTAGTGTAC ACCTTCTAcf TATTAAAAGA AATAGGCCCG 

AGGCGGGCAG atcacgaggt caggagatgg agaccactac gtcgatac 
(2) information for seq id no: 197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 289 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 

TTGGGCACCT TCAATATCAT GACAGGTGAT GTGATAACCA AGAAGGCTAC TAAGTGATTA 
ATGGGTGGGT AATGTATACA GAGTAGGTAC ACTGGACAGA GGGGTAATTC ATAGCCAAGG 
CAGGAGAAGC AGAATGGCAA AACATTTCAT CACACTACTC AGGATAGCAT GCAGTTTAAA 
ACCTATAAGT AGTTTATTTT TGGAATTTTC CACTTAATAT TTTCAGACTG CAGGTAACTA 
AACTGTGGAA CACAAGAACA TAGATAAGGG GAGACCACTA CGTCGATAC 

(2) INFORMATION FOR SEQ ID NO:198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 

GTATCGACGT AGTGGTCTCC CAAGCAGTGG GAAGAAAACG TGAACCAATT AAAATGTATC 6 0 

AGATACCCCA AAGAAAGGCG CTTGAGTAAA GATTCCAAGT GGGTCACAAT CTCAGATCTT 12 0 

AAAATTCAGG CTGTCAAAGA GATTTGCTAT GAGGTTGCTC TCAATGACTT CAGGCACAGT 18 0 

CGGCAGGAGA TTGAAGCCCT GGCCATTGTC AAGATGAAGG AGCTTTGTGC CATGTATGGC 24 0 

AAGAAAGACC CCAATGAGCG GGACTCCTGG AGACCACTAC GTCGATAC 28 8 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY linear 



iXi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 9 : 



GCTTTTTGGG AAAAACNCAA NTGGGGGAAA GGGGGNTTNN TNGCAAGGGG ATAAAGGGGG 60 

AANCCCAGGG TTTCCCCATT CAGGGAGGTG TAAAAAGNCG GCCAGGGGAT TGTAANAGGA 12 0 

TTCAATAATA GGGGG AA.TGG GCCCNGAAGT TGCAA.GGTTC CNGCCCGCCA TGNCCGCGGG 180 

ATTTAGTGAC ATTACGACGS TGGTAATAAA GTGGGSCCAA WAAA.TATTTG TGATGTGATT 24 0 

TTTSGACCAG TGAACCCATT GWACAGGACC TCATTTCCTY TGAGATGRTA GCCATAATCA 3 00 

GATAAAAGRT TAGAAGTYTT TCTGCACGTT AACAGCATCA TTAAATGGAG TGGCATCACC 3 60 

AATTTCACCC TTTGTTAGCC GATACCTTCC CCTTGAAGGC ATT(":AATTAA GTGACCAATC 4 20 

GTCATACGAG AGGGGATGGC ATGGGGATTG ATGATGATAT CAGGGGTGAT ACCTTCACAG 4 80 

GTGAAAGGCA TATCCTCTTG TCTATACTGA ATACCACAAG TACCCTTTTG ACCATGTCGA 54 0 

CTAGCAAATT TGTCTCCAAT CTGTGTWATC CCTAACAGAG CGTACCCTTA TTTTACAAAA 6 00 

TTTATATCCT TCCTGATTGA GAGTTACCAT AACCTGATCC ACAATGCCCG TCTCGCTWGT 660 

TCTGAGAAAA GTGCTACAGT CTCTCTTGGT ATAGCGTCTA TTGGTGCTCT CCAATTCATC 720 

TTCATTTTTC AGGCAAGGTG AACTGTTTTG CCTATAATAA CMTCATCTCC TGATACMCGA 7 80 

AACCCCKGGA RCTATCAAAC CATCATCATC CAGCGTTCKT WATGTYMCTA AATCCCTATT 840 

GCGGCCGCCT GCAGGTCAAC ATATNGGAAA ACCCCCCACC CCTTNGGAGC NTACCTTGAA 9 00 

TTTTCCATAT GTCCCNTAAA TTANCTNGNC TTANCCTGGC CNTAACCTNT TCCGGTTTAA 96 0 

ATTGTTTCCG CCCCCNTTCC CCNCCTTNNA ACCGGAAACC TTAATTTTNA ACCNGGGGTT 10 2 0 

CCTATCC 10 2 7 



(2) INFORMATION FOR SEQ ID NO: 2 00: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 

AGTGACATTA CGACGCTGGC CATCTTGAAT CCTAGGGCAT GAAGTTGCCC CAAAGTTCAG 6 0 

CACTTGGTTA AGCCTGATCC CTCTGGTTTA TCACAAAGAA TAGGATGGGA TAAAGAAAGT 12 0 

GGACACTTAA ATAAGCTATA AATTATATGG TCCTTGTCTA GCAGGAGACA ACTGCACAGG 18 0 
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TATACTACCA GCGTCGTAAT GTCACTA 2 07 

[2] INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 09 base pad.rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201: 

TGGGCACCTT CAATATCTAT TAAAAGCACA AATACTGAAG AACACACCAA GACTATCAAT 60 

GAGGTTACAT CTGGAGTCCT CGATATATCA GGAAAAAATG AAGTGAACAT TCACAGAGTT 12 0 

TTACTTCTTT GGGAACTCAA ATGCTAGAAA AGAAAAGGGT GCCCTCTTTC TCTGGCTTCC 18 0 

TGGTCCTATC CAGCGTCGTA ATGTCACTA 209 

(2) INFORMATION FOR SEQ ID NO: 2 02: 

(i) SEQUENCE CHARACTERISTICS: 

.(A) LENGTH: 34 9. b^s^ ^pairs 

(B) TYP'E : riucleic, acid . . 

(C) STRANDEDNESS: .single 
' (D) TOPOLOGY: linear" 

(xi) SEQUENCE DfiSCRIPTlJONV SEQ ID NO; 2.02: 

NTACGCTGCA ACACTOTGGA GCCACTGGTT TTTATTCQCG GCAGGTTATC CAGCAAACAG 60 

TCACTGAACA CACCGAAGAC CGTGCTATGG. TAACCGTTCA CAGTAATCGT TCCAGTCGTC 12 0 

TGCGGGACCC CGACGAGCGT CACTGGCSTAC^ AGACCAGATT CAGCCGGAAG AGAAAGCGCC 180 

GCAGGGAGAG ACTCGAACTC CACTCCGCTG GTGAGCAGCC CCATGTtTTC AACTCGAAGT 24 0 

TCAAACGGCA TTGGGTTATA TACCATCAGC . TGAACTTCAC ACACATCTCC TTGAACCCAC 3 00 

TGGAAATCTA TTTTCTTGTT CCGCTCTTCT CCACAGTGTT GCAGCGTAA , 34 9 

(2) INFORMATION FOR SEQ ID NO: 2 03: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 03: 

TGCTCCTCTT GCCTTACCAA CCCAAAGCCC ACTGTGAAAT ATGAAGTGAA TGACAAAATT 60 

CAGTTTTCAA CGCAATATAG TATAGTTTAT CTGATTCTTT TGATCTCCAG GACACTTTAA 120 

ACAACTGCTA CCACCACCAC CAACCTAGGG ATTTAGGATT CTCCACAGAC CAGAAATTAT 180 

TTCTCCTTTG AGTTTCAGGC TCCTCTGGGA CTCCTGTTCA TCAATGGGTG GTAAATGGCT 24 0 

A _ 241 

(2) INFORMATION FOR SEQ ID N0:204: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) sequence description: seq id no: 204: 

tagccattta ccacccatct gcaaaccswg acmwwcargr cywgwackya ggcgatttga 60 

agtactggta atgctctgat catgttagtt acataagtgt ggtcagttta caaaaattca 12 0 

cagaactaaa tactcaatgc tatgtgttca tgtctgtgtt tatgtgtgtg taatgtttca 18 0 

attaagtttt tttaaaaaaa agagatgatt tccaaataag aaagccgtgt tggtaaggca 24 0 

agaggagc 24 8 

(2) Information for seq id no;205: 

(i) sequence characteristics: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



: {y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 2 05: 

TACGCTGCAA CACTGTGGAG CCATTCATAC AGGTCCCTAA TTAAGGAACA AGTGATTATG 6 0 

CTACCTTTGC ACGGTTAGGG TACCGCGGCC GTTAAACATG TGTCACTGGG CAGGCGGTGC 12 0 

CTCTAATACT GGTGATGCTA GAGGTGATGT TTTTGGTAAA CAGGCGGGGT AAGATTTGCC 180 

GAGTTCCTTT TACTTTTTTT AACCTTTCCT TATGAGCATG CCTGTGTTGG GTTGACAGTG 24 0 

GGGGTAATAA TGACTTGTTG GTTGATTGTA GATATTGGGC TGTTAATTGT CAGTTCAGTG 3 00 

TTTTAATCTG ACGCAGGCTT ATGCGGAGGA GAATGTTTTC ATGTTACTTA TACTAACATT 36 0 

AGTTCTTCTA TAGGGTGATA GATTGGTCCA ATTGGGTGTG AGGAGTTCAG TTATATGTTT 42 0 

GGGATTTTTT AGGTAGTGGG TGTTGANCTT GAACGCTTTC TTAATTGGTG GCTGCTTTTA 4 80 

RGCCTACTAT GGGTGGTAAA TGGCT 50 5 

(2) INFORMATION FOR SEQ ID NO: 2 06: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 9 base pairs 

(B) TyPE: nucleic acid 

(C) STP-ANDEDNESS : single 
(d; TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

TAGACTGACT CATGTCCCCT ACCAAAGCCC ATGTAAGGAG CTGAGTTCTT AAAGACTGAA 60 
GACAGACTAT TCTCTGGAGA AAAATAAAAT GGAAATTGTA CTTTAAAAAA AAAAAAAATC 12 0 

GGCCGGGCAT GGTAGCACAC ACCTGTAATC CCAGCTACTA GGGGACATGA GTCAGTCTA 17 9 

(2) INFORMATION FOR SEQ ID NO: 2 07: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 
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(B) TYPE: nucleic acid 
iC) STRANDEDNESS : Single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

AGACTGACTC ATGTCCCCTA CCCCACCTTC TGCTGTGCTG CCGTGTTCCT AACAGGTCAC 60 

AGACTGGTAC TGGTCAGTGG CCTGGGGGTT GGGGACCTCT ATTATATGGG ATACAAATTT 12 0 

AGGAGTTGGA ATTGACACGA TTTAGTGACT GATGGGATAT GGGTGGTAAA TGGCTA 17 6 

(2) INFORMATION FOR SEQ ID NO:208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid ' 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: 

AGACTGACTC ATGTCCCCTA TTTAACAGGG TCTCTAGTGC TGTGAAAAAA AAAAATGCTG 6 0 

AACATTGCAT ATAACTTATA TTGTAAGAAA TACTGTACAA TGACTTTATT GC ATCTGGGT 12 0 

AGCTGTAAGG CATGAAGGAT GCCAAGAAGT TTAAGGAATA TGGGTCSGTAA ..ATGGCTAGGG 180 

GACATGAGTC AGTCTA 196 

(2) INFORMATION FOR SEQ ID- NO: 209: 

(i) SEQUENCE CHARACTERISTICS: - VV 

(A) LENGTH^ 345 base pairs ' ' . 

(B) TYPE: nucleic acid " . 

(C) STRANDEDNESS: single ' - 

(D) TOPOLOGY: linear " ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

GACGCTTGGC CACTTGACAC CTTTTATTTT TTAAGGATTC TTAAGTCATT . TANGTNACTT 6 0 

TGTAAGTTTT TCCTGTGCCC CCATAAGAAT GATAGCTTTA AAAATTATGC TGGGGTAGCA 12 0 

AAGAAGATAC TTCTAGCTTT AGAATGTGTA GGTATAGCCA GGATTCTTGT GAGGAGGGGT 18 0 

GATTTAGAGC AAATTTCTTA TTCTCCTTGC CTCATCTGTA ACATGGGGAT AATAATAGAA 24 0 

CTGGCTTGAC AAGGTTGGAA TTAGTATTAC ATGGTAAATA CATGTAAAAT GTTTAGAATG 3 00 

GTGCCAAGTA TCTAGGAAGT ACTTGGGCAT GGGTGGTAAA TGGCT 34 5 

(2) INFORMATION FOR SEQ IDTJO:210: 

(i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH^: 178 base pairs ■ 

(B) " TYPE': nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 
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GACGCTTGGC CACTTGACAC TAGAGTAGGG TTTGGCCAAC TTTTTCTATA AAGGACCAGA 6 0 

GAGTAAATAT TTCAGGCTTT GTGGGTTGTG CAGTCTCTCT TGCAACTACT CAGCTCTGCC 12 0 

ATTGTAGCAT AGAAATCAGC CATAGACAGG ACAGAAATGA ATGGGTGGTA AATGGCTA 17 8 

(2) INFORMATION FOR SEQ ID NO: 2 11: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(XI ) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

TGGGCACCTT CAA.TATCTAT CCAGCGCATC TAAATTCGCT TTTTTCTTGA TTAAAAATTT 6 0 

CACCACTTGC TGTTTTTGCT CATGTATACC AAGTAGCAGT GGTGTGAGGC CATGCTTGTT 12 0 

TTTTGATTCG ATATCAGCAC CGTATAAGAG CAGTGCTTTG GCCATTAATT TATCTTCATT 18 0 

GTAGACAGCA TAGTGTAGAG TGGTATCTCC ATACTCATCT GGAATATTTG GATCAGTGCC 24 0 

ATGTTCCAGC AACATTAACG CACATTCATC TTCCTGGCAT TGTACGGCCT TTGTCAGAGC 3 00 

TGTCCTCTTT TTGTTGTCAA GGACATTAAG TTGACATCGT CTGTCCAGCA CGAGTTTTAC 3 60 

TACTTCTGAA TTCCCATTGG CAGAGGCCAG ATGTAGAGCA GTCCTCTTTT GCTTGTCCCT 4 20 

CTTGTTCACA TCAGTGTCCC TGAGCATAAC GGAA 4 54 

(2) INFORMATION FOR SEQ ID NO: 2 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

TCCGTTATGC CACCCAGAAA ACCTA.CTGGA GTTACTTATT AACATCAAGG CTGGAACCTA 60 

TTTGCCTCAG TCCTATCTGA TTCATGAGCA CATGGTTATT ACTGATCGCA TTGAAAACAT 12 0 

TGATCACCTG GGTTTCTTTA TTTATCGACT GTGTCATGAC AAGGAAACTT AC/^AACTGCA 18 0 

ACGCAGAGAA ACTATTAAAG GTATTCAGAA ACGTGAAGCC AGCAATTGTT TCGCAATTCG 24 0 

GCATTTTGAA AACAAATTTG CCGTGGAAAC TTTAATTTGT TCTTGAACAG TCAAGAAAAA 3 00 

CATTATTGAG GAAAATTAAT ATCACAGCAT AACGGAA 33 7 

(2) INFORMATION FOR SEQ ID NO: 2 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: 
TCGGGTGATG CCTCCTCAGG CATCTTCCAT CCATCTCTTC AAGATTAGCT GTCCCAAATG 60 
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TTTTTCCTTC 


TCTTCTTTAC 


TGATAAATTT 


GGACTCCTTC 


TTGACACTGA 


TGACAGCTTT 


120 


AGTATCCTTC 


TTGTCACCTT 


GCAGACTTTA 


AACATAAAAA 


TACTCATTGG 


TTTTAAAAGG 


180 


AAAAAAGTAT 


ACATTAGCAC 


TATTAAGCTT 


GGCCTTGAAA 


CATTTTCTAT 


CTTTTATTAA 


240 


ATGTCGGTTA 


GCTGAACAGA 


ATTCATTTTA 


CAATGCAGAG 


TGAGAAAAGA 


AGGGAGCTAT 


300 


ATGCATTTGA 


GAATGCAAGC 


ATTGTCAAAT 


AAACATTTTA 


AATGCTTTCT 


TAAAGTGAGC 


360 


ACATACAGAA 


ATACATTAAG 


ATATTAGAAA 


GTGTTTTTGC 


TTGTGTACTA 


CTAATTAGGG 


42 0 


AAGCACCTTG 


TATAGTTCCT 


CTTCTAAAAT 


TGAAGTAGAT 


TTTAAAAACC 


CATGTAATTT 


480 


AATTGAGCTC 


TCAGTTCAGA 


TTTTAGGAGA 


ATTTTAACAG 


GGATTTGGTT 


TTGTCTAAAT 


540 


TTTGTCAATT 


TNTTTAGTTA 


ATCTGTATAA 


TTTTATAAAT 


GTCAAACTGT 


ATTTAGTCCG 


600 


X X X XV„/-lXVJ\^i 


1\ rT'y"' T\ TV "»\ 


AAATACCCAN 


GACAGGGTTA 


TTTATAAANG 


GAAAGANGTT 


660 


AATTTGACTC 


CCAGTTCACA 


GGCCTGAGGA 


NGNATCNCCC 


GAAATCCTTA 


TTGCG 


715 



(2) INFORMATION FOR SEQ ID NO: 2 14: 



(i) SEQUENCE CHARACTERISTICS: 
■ (A) LEIIGTH: 345 base pairs 

(B) TYPE: nucleic acid ' . " 

. (C) STRANDEDNESS : single . ' " 

(D) TOFOLOGY: linear' 

(xi) -SEQUENCE DESCRIPTION,:. S^Q ID N0:,214: "/^ 

GGTAANGNGC ATACNTCGGT GCTCCGGCCG CCGGAGTCGG GGGATTCGGG 'tGATGCCTCC 6 0 

TCAGGCCCAC TTGGGCCTGC TTTTCCCAAA TGGCAGCTCC TCTGGACATG CCATTCCTTC 12 0 

TCCCACCTGC CTGATTCTTC ATATGTTGGG TGTCCCTGTT TTTCTGGTGC TATTTCCTGA 18 0 

CTGCTGTTCA GCTGCCACTG TCCTGCAAAG CCTGCCTTTT TAAATGCCTC ACCATTCCTT 24 0 

CATTTGTTTC TTAA^.TATGG GAAGTGAAAG TGCCACCTGA GGCCGGGCAC AGTGGCTCAC 3 00 

GCCTGTAATC CCAGCACTTT GGGAGCCTGA GGAGGCATCA CCCGA 34 5 

(2) INFORMATION FOR SEQ ID NO:215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 9. base pairs 

(B) TYPE: nucleic acid 

. . , (C) STRANDEDNESS: single 

, (D) TOPOLOGY: linear ' ..~ . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:215: 

GGTGATGCCT CCTCAGGCGA AGCTCAGGGA GGACAGAAAC CTCCCGTGGA GCAGAAGGGC 6 0 

AAAAGCTCGC TTGATCTTGA TTTTCAGTAC GAATACAGAC CGTGAAAGCG GGGCCTCACG 12 0 

ATCCTTCTGA CCTTTTGGGT TTTAAGCAGG AGGTGTCAGA AAAGTTACCA CAGGGATAAC 18 0 

TGGCTTGTGG CGGCCAAGCG TTCATAGCGA CGTCGCTTTT TGATCCTTCQ ATGTCGGCTC 24 0 

TTCCTATCAT TGTGAAGCAG AATTCACCAA GCGfTGOATt GTTCACCCAC TAATAGGGAA 3 00 

CGTGAGCTGG GTTTAGACCG TCGTGAGACA GGTTAGTTTT ACCCTACtGA . TGATGTGTKG 3 60 

TTGCCATGGT AATCCTGCTC AGTACGAGAG GAACCGCAGG, TTCA3ACATT TGGTGTATGT 42 0 

GCTTGCCTT 429 

(2) INFORMATION FOR SEQ ID NO: 2 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 593 base pairs 

(B) TYPE : nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 



TGACACCTAT GTCCNGCATC TGTTCACAGT TTCCACAA/vT AGCCAGCCTT TGGCCACCTC 6 0 

TCTGTCCTGA GGTATACAAG TATATCAGGA GGTGTATACC TTCTCTTCTC TTCCCCACCA 12 0 

AAGAGAACAT GCAGGCTCTG GAAGCTGTCT TAGGAGCCTT I^GGGCTCAGA ATTTCAGAGT 180 

CTTGGGTACC TTGGATGTGG TCTGGAAGGA GATvACATTGG CTCTGGATAA GGAGTACAGC 24 0 

CGGAGGAGGG TCACAGAGCC CTCAGCTCAA GCCCCTGTGC CTTAGTCT/iA AAGCAGCTTT 3 00 

GGATGAGG^^JV GCAGGTTAAG TAACATACGT AAGCGTACAC AGGTAGAAAG TGCTGGGAGT 3 60 

CAG/vATTGCA CAGTGTGTAG GAGTAGTACC TCAATCAATG AGGGCAAATC AACTGAAAGA 42 0 

AGAAGACCNA TTAATGAATT GCTTANGGGG AAGGATCAAG GCTATCATGG AGATCTTTCT 4 80 

AGGAAGATTA TTGTTTANAA TTATGAAAGG ANTAG(^.GCAG GGACAGGGCC AGAAGTANAA 54 0 

GANAACATTG CCTATANCCC TTGTCTTGCA CCCAGATGCT GGACAAGGTG TCA 5 93 



(2) INFORMATION FOR SEQ ID NO: 2 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 17: 



TGACACCTTG TCCAGCATCT GACGTGAAGA TGAGCAGCTC AGAGGAGGTG TCCTGGATTT 6 0 

CCTGGTTCTG TGGGCTCCGT GGCAATGAAT TCTTCTGTCA AGTGGATG;^J\ GACTACATCC 12 0 

AGGACAAJ^TT TAATCTTACT GGACTCAATG AGCAGOTCCC TCACTATCGA CAAGCTCTAG 18 0 

ACATGATCTT GG ACCTGGAG CCTGATGAAG AACTGGAAGA CAACCCCAAC CAGAGTGACC 24 0 

TGATTGAGCA GGCAGCCGAG ATGCTTTATG GATTGATCCA CGCCCGCTAC ATCCTTACCA 3 00 

ACCGTGGCAT CGCCCAGATG CTGGACA?.GG TGTCA 33 5 



(2) INFORMATION FOR SEQ ID NO: 21 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 18: 



TACGTACTGG TCTTGAAGGT CTTAGGTAGA GAAAAAATGT GAATATTTAA TCAAAGACTA 6 0 

TGTATGAAAT GGGACTGTAA GTACAGAGGG AAGGGTGGCC CTTATCGCCA GAAGTTGGTA 12 0 

GATGCGTCCC CGTCATGAAA TGTTGTGTCA CTGCCCGACA TTTGCCGAAT TACTGAAATT 18 0 

CCGTAGAATT AGTGCAAATT CTAACGTTGT TCATCTAAGA TTATGGTTCC ATGTTTCTAG 24 0 

TACTTTTA 24 8 



(2) INFORMATION FOR SEQ ID NO: 2 19: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 
iC) STRMDEDNESS: single 
(D) TOPOLOGY; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 



TGACGCTTGG CCACTTGACA CAAGTAGGGG ATAAGGACAA AGACCCATNA GGTGGCCTGT 60 

CAGCCTTTTG TTACTGTTGC ' TTCCCTGTCA CCACGGCCCC CTCTGTAGGG GTGTGCTGTG 12 0 

CTCTGTGGAC ATTGGTGCAT TTTCACACAT ACCATTCTCT TTCTGCTTCA CAGCAGTCCT 180 

GAGGCGGGAG CACACAGGAC TACCTTGTCA GATGANGATA ATGATGTCTG GCCAACTCAC 24 0 

CCCCCAACCT TCTCACTAGT TAXAJJGAAGA GCCANGCCTA NAACCTTCTA TCCTGNCCCC 3 00 

TTGCCCTATG ACCTCATCCC TGTTCCATGC CCTATTCTGA TTTCTGGTGA ACTTTGGAGC 360 

AGCCTGGTTT NTCCTCCTCA CTCCAGCOTC TCTCCATACC ATGGTANGGG GGTGCTGTTC 42 0 

CACNCAAANG GTCAGGTGTG TCTGGGGAAT CCTNAJJA^ICT GCCNGGAGTT TCCNAIJGCAT 4 80 

TCTTAAJ^AAC CTTCTTGCCT AATCANATNG TGTCCAGTGG CC.AACCNTCN 53 0 



(2) INFORMATION FOR SEQ ID NO: 22 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 



TGACGCTTGG CCACTTGACA CTAAATAGCA. TCTTCTAAAG GCCTGATTCA GAGTTGTGGA 6 0 

AAATTCTCCC AGTGTCAGGG ATTGTCAGGA ACAGGGCTGC TCCTGTGCTC ACTTTACCTG 12 0 

CTGTGTTTCT GCTGGAAAAG GAGGGAAGAG GAATGGCTGA TTTTTACCTA ATGTCTCCCA 180 

GTTTTTCATA TTCTTCTTGG ATCCTCTTCT CTGACAACTG TTCCCTTTTG GTCTTCTTCT 24 0 

TCTTGCTCAG AGAGCAGGTC TCTTTAAAAC TGAGAAGGGA GAATGAGCAA ATGATTAAAG 3 00 

AAAACACACT TCTGAGGCCC AGAGATCAAA TATTAGGTAA ATACTAAACC GCTTGCCTGC 3 60 

TGTGGTCACT TTTCTCCTCT TTCACATGCT CTATCCCTCT ATCCCCCACC TATTCATATG 42 0 

GCTTTTATCT GCCAAGTTAT CCGGCCTCTC ATCAACCTTC TCCCCTAGCC TACTGGGGGA 4 80 

TATCCATCTG GGTCTGTCTC TGGTGTATTG GTGTCAAGTG GCCAAGCGTC A 531 



(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:221: 



ATTGACGCTT GGCCACTTGA CACCCGCCTG CCTGCAATAC TGGGGCAAGG GCCTTCACTG 60 

CTTTCCTGCC ACCAGCTGCC ACTGCACACA GAGATCAGAA ATGCTACCAA CCAAGACTGT 12 0 

TGGTCCTCAG CCTCTCTGAG GAGAAAGAGC AGAAGCCTGG AAGTCAGAAG AGAAGCTAGA 18 0 

TCGGCTACGG CCTTGGCAGC CAGCTTCCCC ACCTGTGGCA ATAAAGTCGT GCATGGCTTA 24 0 

ACAATGGGGG CACCTCCTGA GAAACACATT GTTAGGCAAT TCGGCG_TGTG TTCATCAGAG 300 
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CATATTTACA CAAACCTCGA TAGTGCAGCC TACTATCCAC TATTGCTCCT ACGCTGCAAA 3 60 

CCTGAACAGC ATGGGACTGT ACTGAATACT GGAAGCAGCT GGTGATGGTA CTTATTTGTG 4 20 

TATCTAAACA CAGAGAAGGT ACAGTAAGAi^. TATGGTATCA TAAACTTAGA GGGACCGCCA 4 80 

TCCTATATGC AGTCTGTTGT GAGCAAAATG TGTCAAGTGG CCAAGCGTCA 53 0 



(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 2: 



TGTATCGACG TAGTGGTCTC CGGGCTACTA GGCCGTTGTG TGCTGGTAGT ACCTGGTTCA 6 0 

CTGAAAGGCG CATCTCCCTC CCCGCGTCGC CCTGAAGCAG GGGGAGGACT TCGCCCAGCC 12 0 

AAGGCAGTTG TATGAGTTTT AGCTGCGGCA CTTCCAGACC TCTGAGCCCA CCTCCTTCAG 180 

GAGCCTTCCC CGATTAj^^^GGA AGCCAGGGTA AGGATTCCTT CCTCCCCCAG ACACCACGAA 24 0 

CAAACCACCA CCCCCCCTAT TCTGGCAGCC CATATACATC AGAACGAAAC AAAAATAi^.CA 3 00 

AATAAACNAA AACC.sJUIlAAA AAAAGAGAAG GGGAAATGTA TATGTCTGTC CATCCTGTTG 360 

CTTTAGCCTG TCACCTCCTA NAGGGCAGGC ACCGTGTCTT CCGAATGGTC TGTGCAGCGC 42 0 

CGACTGCGGG Ai^^GTATCCGA GCAGGAAGCA GAGTCAGCAG AAGTTGAACG CTGGGCCCGG 4 80 

CGGCTCTTGG GGGCTGGTGT TGTACTTCGA GACCGCTTTC GCTTTTTGTC TTAGATTTAC 54 0 

GTTTGCTCTT TGGAGTGGGA NACCACTACN TCNATACA 57 8 



(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 578 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC:223: 



TGTATCGACG TAGTGGTCTC CTCTTGCAAA GGACTGGCTG GTGAATGGTT TCCCTGAATT 6 0 

ATGGACTTAC CCTAAACATA TCTTATCATC ATTACCAGTT GCAAAATATT AGAATGTGTT 12 0 

GTCACTGTTT CATTTGATTC CTAGAAGGTT AGTCTTAGAT ATGTTACTTT AACCTGTATG 18 0 

CTGTAGTGCT TTGAATGCAT TTTTTGTTTG CATTTTTGTT TGCCCAACCT GTCAATTATA 24 0 

GCTGCTTAGG TCTGGACTGT CCTGGATAAA GCTGTTAAAA TATTCACCAG TCCAGCCATC 3 00 

TTACAAGCTA ATTAAGTCAA CTAAATGCTT CCTTGTTTTG CCAGACTTGT TATGTCAATC 3 60 

CTCAATTTCT GGGTTCATTT TGGGTGCCCT AAATCTTAGG GTGTGACTTT CTTAGCATCC 42 0 

TGTAACATCC ATTCCCAAGC AAGCACAACT TCACATAATA CTTTCCAGAA GTTCATTGCT 4 80 

GAAGCCTTTC CTTCACCCAG CGGAGCAACT TGATTTTCTA CAACTTCCCT CATCAGAGCC 54 0 

ACAAGAGTAT GGGATATGGA GACCACTACG TCGATACA 57 8 



(2) INFORMATION FOR SEQ ID NO: 2 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : • linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

TGTATCGACG TANTGGTCTC CCAAGGTGCT GGGATTGCAG GCATGAGCCA CCACTCCCAG 6 0 

GTGGATCTTT TTCTTTATAC TTACTTCATT AGGTTTCTGT TATTCAAGAA GTGTAGTGGT 120 

AAAAGTCTTT TCAATCTACA TGGTTAAATA ATGATAGCCT GGGAAATAAA TAGAAATTTT 180 

TTCTTTCATC TTTAGGTTGA ATAAAGAAAC AGAAAAAATA GAACATACTG AAAATAATCT 24 0 

AAGTTCCAAC CATAGAAGAA CTGCAGAAGA AATGAAGAAA GTGATGATGA TTTAGATTTT 3 00 

GATATTGATT TAGAAGACAC AGGAGGAGAC CACTACGTCG ATACA 345 

(2) INFORMATION FOR SEQ ID NO: 22 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 base pairs 

(B) TYPE: nucleic acid 

(C) STPJVNDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUlENCE DESCRIPTION: SEQ ID NO: 225: • 

TGTATCGACG TAGTGGTCTC CAAACTGAGG TATGTGTGCC ACTAGCACAC AAAGCCTTCC 6 0 

AAGAGGGACG CAGGCACAGG CAGTTTAAAG GQAATCTGTT TCTAAATTAJ^. TTTCCACCTT 12 0 

CTCTAAGTAT TCTTTCCTAA AACTGAT-CAA GGTGTGAAGC cfCTGCTCTT TCCCAACTCC 18 0 

CCTTTGACAA CAGCCtTCAA CTAACACAAG AAAAGGCATG TCTGACACTC TTCCTGAGTC 24 0 

TGACTCTGAT ACGTfG'fTCT GATGTCTAAA GAGCTCCAGA AGACCAAAGG GACAATTCAG 3 00 

AATGCTGGTG TATAACAGAC TCCAATGGAG ACCACTACGT CGATACA 34 7 

(2) INFORMATION FOR SEQ ID NO: 22 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

AGGNGNGGGA NTGTATCGAC GTAGTGGTCT CCCAACAGTC TGTCATTCAG TCTGCAGGTG 6 0 

TCAGTGTTTT GGACAATGAG GCACCATTGT CACTTATTGA CTCCTCAGCT CTAAATGCTG 12 0 

AAATTAAATC TTGTCATGAC AAGTCTGGAA TTCCTGATGA GGTTTTACAA AGTATTTTGG 180 

ATCAATACTC CAACAAATCA GAAAGCCAGA AAGAGGATCC TTTCAATATT GCAGAACGAC 24 0 

GAGTGGATTT ACACACCTCA GGAGACCACT ACGTCGATAC A 2 81 

(2) INFORMATION FOR SEQ ID NO: 22 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 27: 

GGGAAACACT TCCTCCCAGC CTTGTAAGGG TTGGAGCCCT CTCCAGTATA TGCTGCAGAA 6 0 

TTTTTCTCTC GGTTTCTCAG AGGATTATGG AGTCCGCCTT AAAAAAGGCA AGCTCTGGAC 12 0 

ACTCTGCAAA GTAGAATGGC CAAAGTTTGG AGTTGAGTGG CCCCTTGAAG GGTCACTGAA 18 0 
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CCTCACAATT GTTCAAGCTG TGTGGCGGGT TGTTACTGAA AGTGGCGGCG TCCCTGATGA 24 0 

GTTTCCCTAC ATTGATGAAT GCCTGAGTTT GGTCAGGAGC ACCCCTTGGG TGGCTCCACT 3 30 

CATGCACCAT TCATAATTTT ACCTCCAAGG TCCTCCTGAG CCAGACCGTG TTTTCGCCTC 3^5 0 

GACCCTCAGC CGGTTCGGCT CGCCCTGTAC TGCCTCTCTC TGAAGAAGAG GAGAGTCTCC 42 0 

CTCACCCAGT CCCACCGCCT TAAAACCAGC CTACTCCCTT AGGGTCATCC CATGTCTCCT 4d0 

CGGCTATGTC CCCTGTAGGC TCATCACCCA TTGCCTCTTG GTTGCAACCG TGGTGGGAGG 54 0 

AAGTAGCCCC TCTACTACCA CTGAGAGAGG CACAAGTCCC TCTGGGTGAT GAGTGCTCCA 6 00 

CCCCCTTCCT GGTTTATGTC CCTTCTTTCT ACTTCTGACT TGTATAATTG GAAAACCCAT 66 0 

AATCCTCCCT TCTCTGAAAA GCCCCAGGCT TTGACCTCAC TGATGGAGTC TGTACTCTGG 72 0 

ACACATTGGC CCACCTGGGA TGACTGTCAA CAGCTCCTTT TGACCCTTTT CACCTCTGAA 78 0 

GAGAGGGAAA GTATCCAAAG AGAGGCCAAA AA3TACAACC TCACATCAAC CAATAGGCCG 84 0 

GAGGAGGAAG CTAGAGGAAT AGTGATTAGA GACCCAATTG GG ACCTAATT GGGACCCAAA 90 0 

TTTCTCAAGT GGAGGGAGAA CTTTTGACGA TTTCCACCGG TATCTCCTCG TGGGTATTCA 96 0 

GGGAGCTGCT CAGAAACCTA TAAACTTGTC TAAGGCGACT GAAGTCGTCC AGGGGCATGA 102 0 

TGAGTCACCA GCAGTGTTTT TAGAGCACCT CCAGGAGGCT TATCAGATTT ACACCCCTTT 10 8 0 

TGACCTGGCA GCrCCCGAAA ATAGCCATGC TCTTAATTTG GCATTTGTGG CTCAGGCAGC 114 0 

CCCAGATAGT AA.^^J^vGGAAAC TCCAAAAACT AGAGGGATTT TGCTGGAATG AATACCAGTC 12 0 0 

AGCTTTTAGA GA.TAGCCTAA. AAGGTTTTTG ACAGTCAAGA GGTTGAAAAA CAAAAACAAG 12 6 0 

CAGCTCAGGC AGCTGAAAAA AGCCACTGAT AAAGCATCCT GGAGTATCAG AGTTTACTGT 13 2 0 

TAGATCAGCC TCATTTGACT TCCCCTCCCA CATGGTGTTT AAATCCAGCT ACACTACTTC 13 8 0 

CTGACTCAAA C7CCAXTATT CCTGTTCATG ACTGTCAGGA ACTGTTGGAA ACTACTGAAA 144 0 

CTGGCCGACC TGATCTTCAA AATGTGCCCC TAGGAAAGGT GGATGCCACC ATGTTCACAG 15 00 

ACAGTAGCAG CTTCCTCGAG AAGGGACTAC GAAAGGCCGG TGCAGCTGTT ACCATGGAGA 15 6 0 

CAGATGTGTT GTGGGCTCAG GCTTTACCAG CAAACACCTC AGCACAAAAG GCTGAATTGA 162 0 

TCGCCCTCAC TCAGGCTCl^C CGATGGGGTA AGGATA.TT/^ CGTTAACACT GACAGCAGGT 16 8 0 

ACGCCTTTGC T^-ITCTGCAT GTACGTGGAG CCATCTACCA GGAGCGTGGG CTACTCACCT 174 0 

CAGCAGGTGG CTGTAATCCA CTGTAAAGGA CATCAAAAGG AAAACACGGC TGTTGCCCGT 18 00 

GGTAACCAGA A^.GCTGATTC AGCAGCTCAA GATGCAGTGT GACTTTCAGT CACGCCTCTA 186 0 

AACTTGCTGC CCACAGTCTC CTTTCCACAG CCAGATCTGC CTGACAATCC CGCATACTCA 192 0 

ACAGAAGAAG A;--AACTGGCC TCAGAACTCA GAGCC7VATAA AAATCAGGAA GGTTGGTGGA 19 80 

TTCTTCCTGA C7CTAGAATC TTCATACCCC GAACTCTTGG GAAAACTTTA ATCAGTCACC 2 04 0 

TACAGTCTAC CA7CCATTTA GGAGGAGCAA AGCTACCTCA GCTCCTCCGG AGCCGTTTTA 2100 

AGATCCCCCA T7TTCAAJ\GC CTAACAGATC MlGCAGCTCT CCGGTGCACA ACCTGCGCCC 2 160 

AGGTAAATGC CA-AAAAAGGT CCTAAACCCA GCCCAGGCCA CCGTCTCCAA GAAAACTCAC 22 20 

CAGGAGAAAA GTGGGAAATT GACTTTACAG AAGTAAAACC ACACCGGGCT GGGTACAAAT 22 8 0 

ACCTTCTAGT A2TGGTAGAC ACCTTCTCTG GATGGACTGA AGCATTTGCT ACCAAAAACG 2 34 0 

AAACTGTCAA TATGGTAGTT AAGTTTTTAC TCAATGAAAT CATCCCTCGA CATGGGCTGC 24 00 

CTGTTTGCCA TAGGGTCTGA TAATGGACCG GCCTTCGCCT TGTCTATAGT TTAGTCAGTC 2 4 60 

AGTAAGGCGT T;^J^J\CATTCA ATGGAAGCTC CATTGTGCCT ATCGACCCCA GAGGTCTGGG 25 2 0 

CAAGTAGAAC GCATGAAGTG CACCCTAAAA AACACTCTTA CAAAATTAAT CTTAGAAACC 25 80 

GGTGTAAATT GTGTAAGTCT CCTTCCTTTA GCCCTACTTA GAGTAAGGTG CACCCCTTAC 2 64 0 

TGGGCTGGGT TCTTACCTTT TGAAATCATG TATGGGAGGG TGCTGCCTAT CTTGCCTAAG 2 7 00 

CTAAGAGATG CCTAATTGGC A.AAA?^.TATCA CAAACTAATT TATTACAGTA CCTACAGTCT 27 60 

CCCCAACAGG TACAAGATAT CATCCTGCCA CTTGTTCGAG GAACCCATCC CAATCCAATT 2 82 0 

CCTGAACAGA CAGGGCCCTG CCATTCATTC CCGCCAGGTG ACCTGTTGTT TGTTAAAAAG 2 8 80 

TTCCAGAGAG AA.GGACTCCC TCCTGCTTGG AAGAGACCTC ACACCGTCAT CACGATGCCA 2 94 0 

ACGGCTCTGA AGGTGGATGG CATTCCTGCG TGGATTCATC ACTCCCGCAT CAAAAAGGCC 3 0 00 

AACAGAGCCC AA2TAGAAAC ATGGGTCCCC AGGGCTGGGT CAGGCCCCTT AAAACTGCAC 3 0 60 

CTAAGTTGGG TGAAGCCATT AGATTAATTC TTTTTCTTAA TTTTGTAAAA CAATGCATAG 3120 

CTTCTGTCAA ACTTATGTAT CTTAAGACTC AATATAACCC CCTTGTTATA ACTGAGGAAT 3180 

CAATGATTTG ATTCCCCCAA AAACACAAGT GGGGAATGTA GTGTCCAACC TGGTTTTTAC 3 24 0 

TAACCCTGTT TTTAGACTCT CCCTTTCCTT TAATCACTCA GCTTGTTTCC ACCTGAATTG 3 3 00 

ACTCTCCCTT AGCTAAGAGC GCCAGATGGA CTCCATCTTG GCTCTTTCAC TGGCAGCCGC 3 3 60 

TTCCTCAAGG ACTTAACTTG TGCAAGCTGA CTCCCAGCAC ATCCAAGAAT GCAATTAACT 34 20 

GATAAGATAC TGTGGCAAGC TATATCCGCA GTTCCCAGGA ATTCGTCCAA TTGATCACAG 34 80 
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CCCCTCTACC CTTCAGCAAC CACCACCCTG ATCAGTCAGC AGCCATCAGC AGCGAGGCAA 3 54 0 

GGCCCTCCAC CAGCAAAAAG ATTCTGACTC ACTGAAGACT TGGATGATCA TTAGTATTTT 3600 
TAGCAGTAAA GTTTTTTTTT CTTTTTGTTT CTTTTTTTCT CGTGCC 3 64 6 

(2) INFORMATION FOR SEQ ID NO: 22 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 19 base pairs ■ 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single. 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA' . 

(vi) ORIGINAL SOURCE: " 
(A) ORGANISM: Homo sapiens 

(:<i) SEQUENCE DESCRIPTION: SEO ID NO:.228: 



-TAAGAGGGTA CAAGATCTAA. GCACAGCCGT CAATGCAGAA CACAGAACGT AGCCTGGTAA 6 0 

GTGTGTTAA.G AGTGGGAATT TTTGGAGTAC. AQAGTAAGGC ACCTAACCCT AGCTGGGGTT 12 0 

TGGTGACCGT CCCAGATGGC TTACAGAAGA'. AAGTGTCCTG AGATGAC^TTT TTAAGAATGA 180 

ATAAGGATAG ACACAAGTGA GGACTGACTT GGCAGTGGTG AATGGTGGGT GGCAAAAAAC 24 0 

TTCGCATGTA TGGAAJ^CTGC ACGTACAGGA ATGAAGAATG AGACTGTGTG GTGTTTAATG 3 00 

AGCTGGAAAT ACTAA.TTTTA TCCTGAAAGT TTTGAAGAGT TAACTAAAAA GTAtTTTTTA 3 60 

GTAAGGAAAT AACCCTACAT TTCAQGGTTA , TTGTTTGTTT Al^TATATTGAA GGTGCCCAA 419 



(2) INPPRMATION FOR SEQ ID NO: 22 9:. . 

■ : (i) SEQUENCE CHARACTERISTICS: 
: • (A) LENGTH: 14 8 base paixs . 

(B) TYPE: nucleic acid. . ' - 

, (C) STRANDEDNESS : srngle 

(D) TOPOLOGY: - linear - '. • ' ' 

(li) MOLECULE TYPE: cDNA , : " - . 

^(.vi) ORIGINAL SOURCE: . 

(A) ORGANISM: Ho-rao sapiens . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:229: 

AAGAGGGTAC CTGTATGTAG CCATGGTGGC AATGAGAGAC TGATTACTAC CTGCTGGAGA 6 0 

TTGTTTAAGT GAGTTAATAT ATTAAGGATA AAGGGAGCCA GGTTTTTTGA CTGTTGGAGA 12 0 

AGGAAATTAC AGATATTGAA GGTCCCAA . 14 8 

(2) INFORMATION FOR SEQ ipNO:230: . . y 

(i). SEQUENCE CHARACTERISTICS: ' , . 
- - (A) LE^JGTH: 257 base, pairs. 

(B) TYPE: nucleic acid 

. ' • (Cj - STRANDEDNESS single 

(D) TOPOLOGY: linear. r . 

(ii) MOLECULE TYPE: cDNA 
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(VI) ORIGINAL SOURCE: 

(A) ORGANISM: Hcmo Sapiens 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 0: 



TAAGAGGGTA Cr«4AAAAAAAA AAAATAGAAC GAATGAGTAA GACCTACTAT TTGATAGTAC 

AACAGGGTGA CTATAGTCAA TGATAACTTA ATTATACATT TAACATAGAG TGTAATTGGA 

TTGTTTGTAA CTCGAAGGAT AAATGCTTGA GAGGATGGAT ACCCCATTCT CCATGATGTA 

CTTATTTCAC ATTACATGCC TGTATCAAAG CATCTCATAT ACCCTATAAA TATGTACACC 

TACTATGTAC CCTCTTA 



60 
120 
180 
240 
257 



(2) INFORMATION FOR SEQ ID NO: 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 3 1 : 

TAAGAGGGTA CGGGTATTTG CTGATGGGAT TTTTTTTTCT TTCTTTTTCT TTGGAAAACA 6 0 

AAATGAAAGC CAGAACAAAA TTATTGAACA AAAGACAGGG ACTAAATCTG GAGAAATGAA 12 0 

GTCCCCTCAC CTGACTGCCA TTTCATTCTA TCTGACCTTC CAGTCTAGGT TAGGAGAATA 18 0 

GGGGGTGGAG GGGATTAATC TGATACAGGT ATATTTAAAG CAACTCTGCA TGTGTGCCAG 24 0 

AAGTCCATGG TACCCTCTTA 26 0 

(2) INFORMATION FOR SEQ ID NO: 2 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO -232: 

TGCTCCTCTT GCCTTACCAA CCACAAATTA GAACCATAAT GAGATGTCAC CTCATACCTG 6 0 

GTGGGATTAA CATTATTTAA AAAATCAGAA GTATTGACAA GGATGTGAAG AAATTAGAAC 12 0 

ATCTGTGCAC TGTTGGTGGG AATGTAAAAA AGGTGTGGCC ACTATGGGTA ACAGCATGAA 18 0 

GGTTCCTCAA AAAAAATTTT TTTTAATCTA CTCTATGATC GATCTTGAGG TTGTTTATGC 24 0 

AAAAGAACTG AAATCAGGAT TTTGAGGAAA TATTCACATT CCCACATCCA TTTCTGCTTT 3 00 

ATTCATAATA CTCAAGAGAT GGAAACAACC TAAATGTCCA TCCCGGGATG AATGGATAAA 36 0 

CACAGTGTGG TATATGCATA CAATGGAATA TTATTTAGTC TTTAAAAAGA AAAATTCTAT 42 0 
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CATATACTAC AACTTANATN AACCTTGAGG ACACAATGCT NAGTGAAATA AGCCACGGAA 4 80 

GGACGAATAC TGCATTATTC CCTTATATGA AGTATCTAAA GTGGTCAAAC TCTTANAGCA 54 0 

NAAAGTAAAA ATGGGTGGTT GCCANACAGT TGGTTAGGCN AGAAGANAAN CCTANT 5 96 

(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI'JDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO:233: 

TCTTCTGAAG ACCTTTCGCG ACTCTTAAGC TCGTGGTTGG TAAGGCAAGA , GGAGCGTTGG 6 0 

TAAGGCAAGA GGAGCGTTGG TAAGGCAAGA GGAGCA 96 

(2) INFORMATION FOR SEQ ID NO:234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: 

TGTAAGTCGA GCAGTGTGAT GATAAAACTT GAATGGATCA ATAGTTGCTT CTTATGGATG 6 0 

AGCAAAGAAA GTAGTTTCTT GTGATGGAAT CTGCTCCTGG CAAAAATGCT GTGAACGTTG 12 0 

TTGAAAAGAC AACAAAGAGT TTAGAGTAGT ACATAAATTT AGAATAGTAC ATAAACTTAG 180 

AATAGTACAT AAACTTAGTA CATAAATAAT GCACGAAGCA GGGGCAGGGC TTGAGAGAAT 24 0 

TGACTTCAAT TTGGAAAGAG TATCTACTGT AGGTTAGATG CTCTCAAACA GCATCACACT 3 00 

GCTCGACTTA CAA 3 13 

(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D_) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 1 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Homo sapiens 

(xi) SEOUEHCE DESCRIPTION: SEQ ID NO : 2 3 5 : 

AACGAGGACA GATCCTTAAA AAGAATGTTG AGTGAAAAAA GTAGAAAATA AGATAATCTC 6 0 

CAAAGTCCAG TAGCATTATT TAAACATTTT TAAAAAATAC ACTGATAAAA ATTTTGTACA 12 0 

TTTCCCAAAA ATACATATGG AAGCACAGCA GCATGAATGC CTATGGGRTT GAGGATAGGG 18 0 

GTTGGGAGTA GGGATGGGGA TAAAGGGGGA AA^^^TAP^^CC AGAGAGGAGT CTTACACATT 24 0 

TCATGAACCA AGGAGTATAA TTATTTCAAC TATTTGTACC VVGAAGTCCAG AAAGAGTGGA 300 

GGCAGAAGGG GGAGAAGAGG GCGAAGAAAC GTTTTTGGGA GAGGGGTCCC ASAAGAGAGA 3 60 

TTTTCGCGAT GTGGCGCTAC ATACGTTTTT CCAGGATGCC TTAAGCTCTG CACCCTATTT 42 0 

TTCTCATCAC TAJ^.TATTAGA TTAAACCCTT TGAAGACAGC GTCTGTGGTT TCTCTACTTC 4 80 

AGCTTTCCCT CCGTGTCTTG CACACAGTAG CTGTTTTACA AGGGTTGAAC TGACTGAAGT 54 0 

GAGATTATTC 5 5 0 

(2) TNFORMATIOM FOR SEQ ID NO: 2 36: 

(i) SE0UE:ICE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 
(E) TYPE: nucleic acid 

(C) STRAlsIDEDNESS • single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGxNAJ:, SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

TAGACTGACT CATGTCCCCT ACCAGAGTAG CTAGAATTAA TAGCACAAGC CTCTACACCC 6 0 

AGGAACTCAC TATTGAATAC ATAAATGGAA TTTATTCAGC CTTAAAAAGT TTGGAAGGAA 12 0 

ATTCTGACAT ATGCTAAAAC ATGGATGAAC CTTGAAGACT TTATGATAAG TAAJVAGAAGC 180 

CAGTCATAAA AGG^-^JW^TA TTGCATGATT CCACTTATAT GAGGTACCTA GAGTAGTCAA 24 0 

TTTCATAGAA ACAC.:^AAATA GAATGGTGTT TGCCAGGGCT TTTGAGGAA/\ AGGGAATGAC 3 00 

AAGTTAGGGG ACATGAGTCA GTCTA 3 25 

(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 7: 

TAGACTGACT CATGTCCCCT ATCTACTCAA CATTTCCACT TGAAGTCTGA TAGGCATCTC 60 
AGACTTATCT TGTCCCAAAG CAAACTCTTT ATTTCTTTTC ATCCTAGTCT TTATTTCTTG 12 0 

TGCTGTCTTA CCCATCTCAA AAGAGTGCCA AAATCCACCA AGTTGCTGAA ACAGAAATCT 180 
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AAGAAATATC CTTGATTCTT CTTTTTCCCA TCTACTTCAC TTCTAATTCA TTAGTAAATA 24 0 

ATCTGTTTCA GAAAACCAAA CACCTCATGT TCTCACTCAT AAGGGGGAGT TGAACAATGA 3 00 

GAACACACAG ACACAGGGAG GGGAACATCA CACACCACGG CCCGTCAGGG AGTATJGGGAC 36 0 

ATGAGTCAGT CTA 3 73 



(2) INFORMATION FOR SEQ ID NO : 2 3 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO:238: 



TAGACTGACT CATGTCCCCT ATAATGCTCC" CAGGCATCAG AAAGCATCTC AAACTGGAGC 6 0 

TGACACCATG GCAGAGGTTT CAGGTAAGTC ACAA.AAGGGG TCCTAAAGAA TTTGCCCTCA 12 0 

ATATCAGAGT GATTAGAAGA AGTGGACAGA GCTACCCAAG TTAAACATAT GCGAGATAAA 180 

AAAAATATGG CACTTGTGAA CACACACTAC AGGAGGAAAA TAAGGAACAT AATAGCATAT 24 0 

TGTGCTATTA TGATGATGAA GAACCTCTCT ANAAGAAAAC ATAACCAAAG AAACAAAGAA 3 00 

AATTCCTGCN AATGTTTAAT GCTATAGAAG AAATTAACAA AAACATATAT TCAATGAATT 3 60 

CAGAAA^GTT AGCAGGTCAN AAGAAAACAA ATCAAAGACC AGAATAATCC CATTTTAGAT 420 

TGTCGAGTAA ACTANAACAG AAAGAATACC ACTGGAAATT GAATTCCTAC GTANGGGACA 4 80 
TGANTCANTC TA - - • < ' 



(2) INFORMATION .FOR SEQ ID NO:239: 

ii) SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:239: 



TGGAAA.GTAT TTAATGATGG GCAACTTGCT GTTTACTTCC TACATATCCC ATCATCTTCT 60 

GTATTTTTTT AAATAACTTT TTTTTGGATT TTTAAAGTAA CCTTATTCTG AGAGGTAACA 12 0 

TGGATTACAT ACTTCTAAGC CATTAGGAGA CTCTATGTTA AACCAAAAGG AAATGTTACT 180 

AGATCTTCAT TTGATCAATA GGATGTGATA ATCATCATCT TTCTGCTCTA ATGGAAAAGT 240 

ACTANAAACA TGGAACCATA ATCTTAGATG AACAACGTTA GAATTTGCAC TAATTCTACG 3 00 

GAATTTCAGT AATTCGGCAA ATGTCGGGCA GTGACACAAC ATTTCATGAC GGGGACGCAT 3 60 

CTACCAACTT CTGGCGATAA GGGCCACCCT TCCCTCTGTA CTTACAGTCC CATTTCATAC 42 0 

ACAGTCTTTG ATTAAATATT CACATTTTTT CTCTACCTAA AGACCTTCAA GACCAGTACG 4 80 

TA . . ' - 4 82 
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(2} INFORMATION FOR SEQ ID NO: 240: 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

TGTATCGACG TAGTGGTCTC CCCATGTGAT AGTCTGAiW ATAGCCTCAT GGGATGAGAG 

GCTGTGCCCC AGCCCGACAC CCGTAAAGGG TCTGTGCTGA GGTGGATTAG TAAAAGAGGA 

AAGCCTTGCA GTTGAGATAG AGGAAGGGCA CTGTCTCCTG CCTGCCCCTG GGAACTGAAT 

GTCTCGGTAT AAAACCCGAT TGTACATTTG TTC;iATTCT(^, AGATAGGAGA .^AAACCACCC 

TATGGCGGGA GGCGAGACAT GTTGGCAGCA ATGCTGCCTT GTTATGCTTT ACTCCACAGA 

TGTTTGGGCG GAGGGAAACA TAAATCTGGC CTACGTGCAC ATCCAGGCAT AGTACCTCCC 

TTTGAACTTA ATTATGACAC AGATTCCTTT GCTCACATGT I'TTTTTGCTG ACCTTCTCCT 

TATTATCACC CTGCTCTCCT ACCGCATTCC TTGTGCTGAG ATAATGAAAA TAATATCAAT 
AAAAACTTGA NGGAACTCGG AGACCACTAC GTCGATACA 

(2) INFORMATION FOR SEQ ID NO: 241: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 



TGTATCGACG TAGTGGTCTC CACTCCCGCC TTGACGGGGC TGCTATCTGC C^rTCCAGGCC 60 

ACTGTCACGG CTCCCGGGTA GAAGTCACTT ATGAGACACA CCAGTGTGGC CTTGTTGGCT 12 0 

TGAAGCTCCT CAGAGGAGGG TGGGAACAGA GTGACCGAGG GGGCAGCCTT GGGCTGACCT 180 

AGGACGGTCA GCTTGGTCCC TCCGCCAAAC ACGAGAGTGC TGCTGCTTGT ATATGAGCTG 24 0 

CAGTAATAAT CAGCCTCGTC CTCAGCCTGG AGCCCAGAGA TGGTCAGGGA GGCCGTGTTG 3 00 

CCANACTTGG AGCCAGAGAA GCGATTAGAA ACCCCTGAGG GCCGATTACC GACCTCATAA 36 0 

ATCATGAATT TGGGGGCTTT GCCTGGGTGC TGTTGGTACC ANGAGACATT ATTATAACCA 42 0 

CCAACGTCAC TGCTGGTTCC ANTGCAGGGA AAATGGTTGA TCNAACTGTC CAAGAAAACC 480 

ACTACGTCCA TACCAATCCA CTAATTGCCN GCCGCCTGCA GGTTCAACCA TATTGGGGAA 54 0 

NAACTCCCCN CCGCCGTTTG GGATTGNCAT NAACCTTTGA AATTTTTTCC TATTANTTGT 600 

CCCCCTAAAA TAAACCNTTG GGCNTTAATC CATTGGGTCC ATANCTTNTT TNCCCGGTTT 660 

TTAAAANTTG TTTATCCCGC CNCCCNATTT CCCCCCCAAC TTTCCAAAAC CCGAAACCNT 72 0 

TNAAATTTNT TNAAACCCTG GGGGGTTCCC NNAATTNNAN TTNAANCTNC C 771 

(2) INFORMATION FOR SEQ ID NO:242: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 167 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vij ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TGGGCACCTT CAATATCGGG CTCATCGATA ACATCACGCT GCTGATGCTG CTGTTGCTGG 6 0 

TCCTCTCTAG GAACCTCTGG ATTTTCAAAT TCTTTGAGGA ATTCATCCAA ATTATCTGCC 12 0 

TCTCCTCCTT TCCTCCTTTT TCTAAGGTCT tCTGGTACAA GCGGTCA 167 

(2) INFORMATION FOR SEQ ID NO:243: 

(i) SEQUENCE CHARACTERISTICS: - 

(A) LENGTH: 3 3 8 ' ISase - pai rs ' ■. - 

' .-; (B) TYPE : /■ nuGleid'^aGid'' ' " - 

(C) STRANDEDNESS.: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: ' ••• 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: ■ 

TTGGGCACCT TCAATATCTA CTGATCTAAA TAGTGTGGTT TGAGGCCTCT TGTTCCTGGC 6 0 

TAAAAATCCT TGGCAAGAGT CAATCTCCAC TTTACAATAG AGGTAAAAAT CTTACAATGG 12 0 

ATATTCTTGA CAAAGCTAGC ATAGAGACAG CAATTTTACA CAAGGTATTT TTCACCTGTT 18 0 

TAATAACAGT GGTTTTCCTA CACCCATAGG GTGCCACCAA GGGAGGAGTG' CACAGTTGCA 24 0 

GAAACAAATT AAGATACTGA AGACAACACT ACTTACCATT TCCCGTATAG CTAACCACCA 3 00 

GTTCAACTGT ACATGTATGT TCTTATGGGC AATCAAGA 33 8 

(2) INFORMATION FOR SEQ ID NO:244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS- single 
. (D.) TOPOLOGY: linear . 

(ii) MOLECULE TYPE.: cDNA ■ 

(vi) ORIGINAL SOURCE : : I 

(A) ORGANISM : Momo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:244: 
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TTTTTGGCTC CCATACAGCA CACTCTCATG GGAAATGTCT GTTCTAAGGT CAACCCATAA 6 0 

TGCAAAAATC ATCAATATAC TTGAAGATCC CCGTGTAAGG TACAATGTAT TTAATATTAT 12 0 

CACTGATACA ATTGATCCAA TACCAGTTTT AGTCTGGGAT TGAATCA^AAT CACTGTTTTT 180 

GTTGTATAAA AAGAGAAATA TTTAGCTTAT ATTTAAGTAC CATATTGTAA GAAAAAAGAT 24 0 

GCTTATCTTT ACATGCTAAA ATCATGATCT GTACATTGGT GCAGTGAATA TTACTGTAAA 3 00 

AGGGAAGAAG GAATGAAGAC GAGCTAAGGA TATTGAAGGT GCCCAA 34 6 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(XI ; SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

ACCAATCCCA CACGGATACT GAGGGACAAG TATATCATCC CATTTCATCC CTACAGCAGC 6 0 

AACTTCATGA GGCaGGAGTT ATTAGTCCCA TTTTACAGAA GAGGAAACTG AGACTTAGGG 12 0 

AGATCAAGTA ArTTGCCCAG GTCGCACAAT TAGTGATAGA GCCAGGGCTT GA/vGCGACGT 180 

CTGTCTTAAG CCAATGACCC CTGCAGATTA TTAGAGCAAC TGTTCTCCAC AACAGTGTAA 24 0 

GCCTCTTGCT ANAA.GCTCAG GTCCACAAGG GCAGAGATTT TTGTCTGTTT TGCTCATTGC 3 00 

TCCTTCCCCA TTGCTTAGAG CAGGGTCTGC CACGAANCAG GTTCTCAATG CATAGTTATT 3 60 

AAATGTATAT AAGAGCAAAC ATATGTTACA GAGAACTTTC TGTATGCTTG TCACTTACAT 420 

GAATCACCTG TGANATGGGT ATGCTTGTTC CCCAJ^TGTTG CAGATNAAGA TATTGAAI^IGT 4 80 

GCCCAAATCA CTAI^JTTGCGG GCGCCTGCAN GTCCANCATA T 521 

(2) INFORMATION FOR SEQ ID NO: 24 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 6: 

TGGAACCAAT CCAAATACCC ATCAA.TGATA GACTGGATAA AGAAAATTTG GCACATGTTC 6 0 

ACCATGAAAT ACTATGCAGC CATAAAAAAG GATGAGTTCA TATCCTTTGC AGGGACATGG 12 0 

ATGAAGCTGG AGACCATCAT TCTCAGCAAA CTAACAAGGG AACAGAAAAC CAAACACTGC 180 

ATGTTCTCAC TCTTAAGTGG GAGCTGAACA ATGAGAACAC ATGGACACAG GGAGGGGAAC 24 0 

ATCACACAGT GGGGCCTGCT GGTGGGTAGG GGTCTAGGGG AGGGATAGCA TTAGGAGAAA 3 00 

TACCTAATGT AGATGACGGG TTGATGGGTG CAGCAAACCA CCATGACACG TGTATACCTA 3 60 

TGTAACAAAC CTGCATGTTC TGCACATGTA CCCCAGAACT TAAAGTGTTA ATAAPJ^JKAAT 42 0 

TAAGAAAAAA GTTAAGTATG TCATAGATAC ATAAAATATT GTANATATTG AAGGTGCCCA 4 80 
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(2) INFORMATION FOR SEQ ID NO: 24 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(vi) ORIGINAL SOURCE: 

(A) ORGAJaiSM: Homo sapiens 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 7: 

TTCGATACAG GCACAGAGTA AGCAGAAAAA TGGCTGTGGT TTAACCAAGT GAGTACAGTT 6 0 

AAGTGAGAGA GGGGCAGAGA AGACAAGGGC ATATGCAGGG GGTGATTATA ACAGGTGGTT 12 0 

GTGCTGGG;^-A GTGAGGGTAC TCGGGGATGA GGAACAGTGA AAAAGTGGCA AAAAGTGGTA 180 

AGATCAGTGA ATTGTACTTC TCCAGAATTT GATTTCTGGN GGAGTCAAAT AACTATCCAG 24 0 

TTTGGGGTAT CATANGGCAA CAGTTGAGGT ATAGGAGGTA GAAGTCNCAG TGGGATAATT 3 00 

GAGGTTATGA ANGGTTTGGT ACTGACTGGT ACTGACAANG TCTGGGTTAT GACCATGGGA 3 60 

ATGAATGACT GTANitiJVGCGT ANAGGATGAA ACTATTCCAC GANAAAGGGG TCCNAAAACT 42 0 

AAAAANNNAA GNNNNNGGGG AATATTATTT ATGtGGATAT TGAANQTGCC CAAA 4 74 

(2) INFORMATION FOR SEQ ID NO: 24 8: 



' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3'§5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO;248: 

TTCGATACAG GCAAACATGA ACTGCAGGAG GGTGGTGACG AtCATGATGT TGCCGATGGT 6 0 

CCGGATGGNC ACGAAGACGC ACTGGANCAC GTGCTTACGT CCTTTTGCTC TGTTGATGGC 12 0 

CCTGAGGGGA CGCAGGACCC TTATGACCCT CAGAATCTTC ACAACGGGAG ATGGCACTGG 18 0 

ATTGANTCCC ANTGACACCA GAGACACCCC AACCACCAGN ATATCANTAT ATTGATGTAG 24 0 

TTCCTGTAGA NGGCCCCCTT GTGGAGGAAA GCTCCATNAG TTGGTCATCT TCAACAGGAT 3 00 

CTCAACAGTT TCCGATGGCT GTGATGGGCA TAGTCATANT TAACCNTGTN TCGAA 3 55 

(2) INFORMATION FOR SEQ ID NO: 24 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 34 base pairs 

(B) TYPE t nucleic acid 

(C) STRANDEDNESS: single " . . J. ' . 

(D) TOPOLOGY: linear ' " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 

TTGGATTGGT CCTCCAGGAQ AACAAGGGGA AAAAGGTGAC CGAGGGCT.CC CTGGAACTCA 6 0 

AGGATCTCCA GGAGC'AAAAG GGGATGGGGG AATTCCTGGT CCTGCTGGTC CCTTAGGTCC 12 0 
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ACCTGGTCCT CCAGGCTTAC CAGGTGCTCA AGGCCCAAAG GGTAACAAAG GCTCTACTGG 180 

AGCCGCTGGC CAGAAAGGTG AGAGTGGTCT TCCAGGGCCT CCTGGGGGTC CAGGTCCACC 24 0 

TGGTGAAGTC ATTCAGGCTT TAGCAATGTT GTCCTCCAAA AAAACGAGAA GAGATACTGA 3 00 

AGGCATGGAA GCAGATGCAG ATGATAATAT TCTTGATTAG TCGGATGGAA TGGAAGAAAT 3 60 

ATTTGGTTGG GTCAATTCCC TGAAACAAGA CATCGAGCAT ATGAAATTTC GAATGGGTAC 42 0 

TCAGACCAAT CCAA 434 

(2) INFORMA.TION FOR SEQ ID NO: 2 50: 

(i) SEQUENCE CHARACTER I ST I GS : 

(A) LENGTH: 430 base pairr. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLiGGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NC:250: 

TGGATTGGTC ACATGGCAGA GACAGGATTC CAAGGCAGTG AGAGGAGGAT ACAATGCTTC 6 0 

TCACTAGTTA TIATTATTTA TTTTATTTTT GAGATGA.AGT CTCGCTTTGT CTCCCAGGCT 12 0 

GGAGAGCGGT GGTGCGATCT TGGCTCTCTG CAACCCCCGC CTCAAGCAAT TCTGCTGTCT 18 0 

TAGCCTCGCG GGTAGATGGA ATTACAGGCG CCCACCGCCA TGCCCAACTA ATTTTTTTGT 24 0 

GTCTTCAGTA GAGACAGGGT TTCGCCATGT TGGGCAGGCT GGTCTTGAAG TCCTGACCTC 3 00 

NAGTGATCTG CCCTCGTCGG CCTCACAAAG TGCTGGA;i.TT ACAGGCATGG GGTGCTGCAC 36 0 

CCAGTCPJKCT TCTCACTAGT TATGGCCTTA TCATTTTCAC CACATTCTAT TGGCCCAAAA 420 

AAA A^^JVAAAN 4 3 q 

(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARAGTERISTIC3 : 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 251: 

TGGTACTCCA CCATYATGGG GTCAACCGCC ATCCTCGCCC TCCTCCTGGC TGTTCTCCAA 6 0 

GGAGTCTGTG CCGAGGTGCA GCTGRTGCAG TCTGGAGCAG AGGTGAAAAA GTCCGGGGAG 12 0 

TCTCTGAAGA TGTCCTGTAA GGGTTCTGGA TACACCTTTA AGATCTACTG GATCGCCTGG 180 

GTGCGCCAGT TGCCCGGGAA AGGCCTGGAG TGGATGGGGC TCATCTTTCG TGATGACTGT 24 0 

GATACCAGAT ACAGCCCGTC CTTCCAAGGG CAGGTCACCA TCTCAGTCGA TAAGTGCATC 3 00 

AGCACCGCCT ATCTGCAGTG GAGTACCAA 32 9 

(2) INFORMATION FOR SEQ ID NO:252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SKQ ID NO: 252: 

TGGTACTCCA CTCAGCCCAA CCTTAATT/\A GAJ\TTAAGAG GGAACCTATT ACTATTCTCC 6 0 

CAGGCTCCTC TGCTCTAACC AGGCTTCTGG GACAGTATTA GAAAJ\GGATG TCTCAACAAG 12 0 
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TATGTAGATC 
TTAATGGTCG 
AGAGGAAGAA 
GGCCCCCTTC 
TCCTTGCTTA 

GTTTCTGGTC 



CTGTACTGGC CTAAGAAGTT AAACTGAGAA TAGCATAAAT CAGACCAAAC 

TTGAGACTTG TGTCCTGGAG CAGCTGGGAT AGGAAAACTT TTGGGCAGCA 

CTGCCTGGA-A GGGGGCATCA TGTTAAAAAT TACAAGGGGA ACCCACACCA 

CCAGCTCTCA GCCTAGAGTA TTAGCATTTC TCAGCTAGAG ACTCACAAGT 

GAATGTGCCA CCGGGGGGAG TCCCTGTGGG TGATGAGGGT CTCAAGAGTG 

GCTATCTTCT GTGTGCCCAC AGGAGCCTGG CCCGAGACTT AGCAGGTGAA 

CAGGCTTTGC CCTTGACTCA CTATGTGACC TCTGGTGGAG TACCAA 



480 



420 



536 



240 
30 0 



360 



180 



(2) INFORMATION FOR SEQ ID NO:253: 

( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 507 base pairs' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single - ■ ' • 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

'NTGTTGCGAT CCCAGTAACT CGGGAAGCTG AGGCGGGAGG ATCACCTGAG CTCAGGAGGT 6 0 

TGAGGCCGCA GTGAGCCGGG ACCACGCCAC TACACTCCAG CCTGGGGCAT AGAGTGAGAC 12 0 

CCTCCAAGAC AGAAAAGAAA AGAAAGGAAG GGAAAGGGAA AGGGAAAAGG AAAAGGAAAA 18 0 

GGAAAAGGAA AAGGAAAAGA CAAGACAAAA CAAGACTTGA ATTTCGAT'CT CCTGACTTCA 24 0 

ATTTTATGTT CTTTCTACAC GACT^TTCCT CTGCTTACTA AGATGATAAT TTAGAAACCC 3 00 

CTCGTTCCAT TCTTTACAGC AAGCTGGAAG TTTGGTCAAG TAATTA'CAAT AATAGl-AACA 3 60 

AATTTGAATA TTATATGCCA GGTGTTTTTC ATTCCTGCTC TCACTTAATT ■^CTCACCACTC 42 0 

TGATATAAAT ACAATTGCTG CCGGGTGTGG TGGCTCATGC CTGTAATCCC GGCACTTTGG 4 80 

GAGACCGAGG TGGGCGGATS GCAACAA • 50 7 

(2) INFORMATION FOR SEQ ID NO: 2 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 base pairs 
{B} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' . 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO.:254: 

TTGGATTGGT CACTGTGAGG AAGCCAAATC GGATCCGAGA GTCTTTTTCT AAAGGCCAGT 6 0 

ACTGGCCACA CTTTCTCCTG CCGCCTTCCT CAAAGCTGAA GACACACAGA GCAAGGCGCT 12 0 

TCTGTTTTAC TCCCCAATGG TAACTCCAAA CCATAGATGG TTAGCTNCCC TGCTCATCTT 180 

TCCACATCCC TGCTATTCAG TATAGTCCGT GGACCAATCC AA ' 222 

(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHAJ>ACTERIST1CS': . 

(A) LENGTH: 4 63 base pairs 

(B) TYPE: nucleic acid ■ . 

(C) STRANDEDNESS: single ^ . 

(D) TOPOLOGY: linear - ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

TGTTGCGATC CATAAATGCT GAAATGGAAA TAAACAACAT GATGAGGGAG GATTAAGTTG 60 
GGGAGGGAGC ACATTAAGGT GGCCATGAAG TTTGTTGGAA GAAGTGACTT TTGAACAAGG 120 
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CCTTGGTGTT AAGAGCTGAT GAGAGTGTCC GAGACAGAGG GGCCACTGGT ACAATAGACG 

AGATGGGAGA GGGGTTGGAA GGTGTGCGAA ATAGGAAGGA GTTTGTTCTG GTATGAGTGT 

AGTGAACACA GAGGCGAGAG GGCCTGGTGG CTGCAGCTGG AGAGTTATGG AGA.ATAACAT 

TAGGCCCTGT GGGGGAGTGT AGACTGTCAG CAATAATCCA CAGTTTGGAT TTTATTCTAA 

GAGTGATGGG AAGCCGTGGA AAGGGGGTTA AGCAAGGAGT GAAATTATCA GATTTACAGT 

GATAAAAATA AATTGGTCTG GCTACTGGGG AAAAAAAAAA AAA 

(2) INFORMATION FOR SEQ ID NO: 2 56: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 



TTGGATTGGT CAA.CCTGCTC AA.CTCTACYT TTCCTCCTTC TTCCTAAAAA ATTAATGAAT 

CCAATACATT PJKTGCCAPJ^A CCCTTGGGTT TTATCAATAT TTCTGTTAAA AAGTATTATC 

CAGAACTGGA CATAATACTA CATAATAATA CATAACAACC CCTTCATCTG GATGCAAACA 

TCTATTAATA TAGCTTAAGA TCACTTTCAC TTTACAGAAG CAACATCCTG TTGATGTTAT 
TTTGATGTTT GGACCAATCC AA 

(2) INFORMATION FOR SEQ ID NO: 257: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

GNGGNNNNNN NNNCAATTCG ACTCNCTTCC CNTGGTi^vNCC GGTCGACATG GCCGCGGGAT 

TACCGCTTGT NNCTGGGGGT GTATGGGGGA CTATGACCGC TTGTAGCTGG GGGTGTATGG 

GGGACTATGA CCGCTTGTAG MTGGKGGTGT ATGGGGGACT ATGACCGCTT GTCGGGTGGT 

CGGATAAACC GACGCAAGGG ACGTGATCGA AGCTGCGTTC CCGCTCTTTC GCATCGGTAG 

GGATCATGGA CAGCAATATC CGCATTCGYC TGAAGGCGTT CGACCATCGC GTGCTCGATC 

AGGCGACCGG CGACATCGCC GACACCGCAC GCCGTACCGG CGCGCTCATC CGCGGTCCGA 

TCCCGCTTCC CACGCGCATC GAGAAGTTCA CGGTCAACCG TGGCCCGCAC GTCGACAAGA 

AGTCGCGCGA GCAGTTCGAG GTGCGTACCT ACAAGCGGTC A 

(2) INFORMATION FOR SEQ ID NO:258: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 base pairs 

(B) TYPE: nucleic acid 
..(C) STRANDEDNESS: single 
, * (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 



TGACCGCTTG TAGCTGGGGG TGTATGGGGG ACTACGACCG CTTGTAGCTG GGGGTGTATG 
GGGGACTATG ACCGCTTGTA GCTGGGGGTG TATGGGGGAC TATGACCGCT TGTAGCTGGG 
GGTGTATGGG GGACTAGGAC CGCTTGTAGC TGGGGGTGTA TGC^GGACTA TGACCGCTTG 
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TAGCTGGGGG TGTATGGGGG ACTACGAGCG CTTGTAGGTG GGGGTGTATG GGGGACTATG 24 0 

ACCGGTTGTA NCTGGGGGTG TATGGGGGAC TATGACGGGT TGTGCTGCCT GGGGGATGGG 3 00 

AGGAGAGTTG TGGTTGGGGA AAAAAAAAAA AA 3^2 

(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRATJDEDNESS : single 

(D) TOPOLOGY: linear 

(x-i) SEQUENCE DESCRIPTION: SEQ ID NO:259: 

TACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTQT GACCGCTTGT 6 0 

GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT GACCGCTTGT 12 0 

GACCGCTTGT GACCGCTTGT NACNGGGGGT GTCTGGGGGA CTATGANNGA MTGTNACTGG 18 0 

GGGTGTCTGG GGGNCTATGA t^GAJSTTGTNA CNGGGGGTGT CTGGGGGAdT ATGAMnIGACT 24 0 

GTGCNNCCTG GGGGATCNGA GGAGANTNGN GGNTAGNGAT GGTTNGGGAN A 2 91 

(2) INFORMATION FOR SEQ I D - NO: 2 6 0 :' ' ' " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single' 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 

TAAGAGGGTA CTGGTTAAAA TACAGGAAAT CTGGGGTAAT GAGGCAGAGA ACCAGGATAC 6 0 

TTTGAGCTCA GGGATGAAAA CTAGAATTTT tTTCTTTTTT TTTGCCTGAG AAACTTGCTG 12 0 

CTCTGAAGAG GCCCATGTAT TAATTGCTTT GATCTTCCTT TTCTTACAGC CCTTTCAAGG 18 0 

GCAGAGCCCT CCTTATCCTG AAGGAXTCTT ATCCTTAGCT ATAGTATGTA CCCTCTTA 23 8 



(2) INFORMATION FOR SEQ ID NO:261: 

' ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 746 base pairs " " ' 

'(B) TYPE: nucleic aci<i 
iC) STRANDEDNESS": single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:261: 

TTGGGCACCT TCAATATCAA TAGCTAACAT TTATTGAGTG TTTATCGTAT CATAAAACAC 6 0 

TGTTCTAAGC CTTTAJVACGT ACTAATTCAT TTAATGCTCA TAATCACTTT AGAAGGTGGG 12 0 

TACTAGTATT AGTCTCATTT ACAGATGCAA CATGCAGGCA CAGAGAGGTT AATTAACTTG 180 

CCCAAGGTAA CACAGCTAAG AAATAGAAAA AATATTGAAT CTGGAAAGTT GGGCTTCTGG 24 0 

GTAACCCACA GAGTCTTCAA TGAGCCTGGG GCCTCACTCA GTTTGCTTTT ACAAAGCGAA 3 00 

TGAGTAACAT CACTTAATTC AGTGAGTAGG CCAAATGGAG GTCAGCTACG AGTTTCTGCT 3 60 

GTTCTTGCAG TGGACTGACA GATGTTTACA ACGTCTGGCC ATCAGTWAAT GGACTGATTA 420 

TCATTGGGAW GTGGGTGGGC TGAATGTTGG CCAGTGAAGT TTATTCAWGC CATATTTTTA 4 80 

TGTTTAGGAT GACTTTTGGC TGGTCCTAGG GCAAGCTCTG TCTGSCACGG AACACAGAAT 54 0 

WACACAGGGA- CCCCCTCAAT TTCTGGTGTG GCTAGAACCA TGAACCACTG GTTGGGGGAA 600 
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CAAGCGGTCA AA.ACCTAAGT GCGGCCGGCT GGCAGGGTCC ACCCATATGG GGAAAACTCC 6 60 

CNACGCGTTT GGAATGCCTN AGCTNGAA.TT ATTCTAANAG TTGTCCNCNT AAAATTAGCC 72 0 

TGGGCGTTAA TCAJ^GGGTCN NAAGCC 74 6 

(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: Sas base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262; 



TCACCGCTTG 


TCATCTCACA 


TOGGGTCCTG 


CACOCTTTTG 


CCTTTGTAGG 


A..AACCTGACA 


60 


TTTGTCTGTT 


TCTTCTTTCT 


CTTTTCCTTC 


CCATATCCTC 


CTAATTTACG 


TTTGACTTGT 


120 


TTGCTGAGGA 


GGCAGGAGCT 


AGAGACTGCT 


GTGAGCTCAT 


AGGGGTGGGA 


AGTTTATCCT 


180 


TCAAGTCCCG 


CCCACTCATC 


ACTGCTTCTC 


ACCTTCCCCT 


GACCAGGCTT 


ACAAGTGGGT 


240 


TCTTGCCTGC 


TTTCCCTTTG 


GACCCAACAA 


■^CCCCTCTAA 


TGAGTGTGCA 


TGACTCTGAC 


300 


AGCTGTGGAC 


TCAGGGTCCT 


TGGCTACAGC 


TGCCATGTAA 


AATATCTCAT 


CCAGTTCTCG 


360 


CAAATTGTTA 


AAATAACCAC 


ATTTCTTAGA 


TTCCAGTACC 


CAAATCATGT 


CTTTACGAAC 


420 


TGCTCCTCAC 


ACCCAGAAGT 


GGCACAATA.-^. 


TTCTTGGGGA 


ATTATTACTT 


TTTTTTTTCT 


4 80 


CTCTNTTMNC 


GNNNGNNNNG 


GNNNGNCCAG 


GAATTACCAC 


NTTGGAAGAC 


CTGGCCNGAA 


S40 


TTTATTATAN 


AGGGGAGCCG 


ATTNTTTTTC 


CTAACACAJ^JV 


GCGGGTCA 




588 



(2) INFORMATION FOR SEQ ID NO : 2 6 ? : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRA^JDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

^^^^^^^^^^ TTTGGCCTGA GCAACTGAAA TTATGAAATT TCCATATACT CAAAAGAGTA 60 

AGACTGCAAA AAGATTAAAT GTAAAAGTTG TCTTGTATAC AGTAATGTTT AAGATACCTA 12 0 

TTANATTTAT AAATGGAAAA TTAGGGCATT TGGATATACA AGTTGAAAAT TCAGGAGTGA 180 

GGTTGGGCTG GCTGGGTATA TACTGAAAAC TGTCAGTACA CAGATGACAT CTAAAACCAC 24 0 

AAATCTGGTT TTATTTTAGC AGTGATATGT GTCACTCCCA CAAAAGCCTT CCCAATTGGC 3 00 

CTCAGCATAC ACAACAAGTC ACCTCCCCAC AGCCCTCTAC ACATAAACAA ATTCCTTAGT 36 0 

TTAGTTCAGG AGGAAATGCG CCCTTTTCCT TCCGCTCTAG GTGACCGCAA 3GCCCAGTTC 4 20 

TCGTCACCAA GATGTTAAGG GAAGTCTGCC AAAGAGGCAT CTGAAAGGAA ATAAGGGGAA 4 80 

TGGGAGTGAC CACAAAGGAA AGCCAAGGAJ^ AAACTTTGGA GACCGTTTCT AGA_NCCCTGG 54 0 

CATTTCACAA CAAAACTCNG GAACAAACCT TGTCTCATCA ATCATTTAAG CCCTTCGTTT 6 00 

GGAJWAGACT TTCTGAACTG GGCGCTGAAC ATAANCCTCA TTGAATGTCT TCACAGTCTC 6 60 

CCAGCTGAAG GCACACCTTG GGCCAGAAGG GGAATCTTCC AGGTCCTCAA NACAGGGCTC 72 0 

GCCCTTTGNC 73 0 

(2) INFORMATION FOR SEQ ID NO:264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(xi) SE'QUENCE DESCRIPTION: SEQ ID NO:264: 



TACACTTAAT 
GTCTCTTTTT 
GATTCTATTT 
TTCTGTCATT 
AACCTTACAT 
TGTNTGTCAT 
TTGCTTAAAA 
TTTATTCTCA 
GCATTGGTTT 
CTGGGAGAGG 
GGCCGGGAAA 



TTTGGCCAGT 
GTGGTTATAG 
TGTTCCTTTT 
TATATAGATT 
TCAATGTGCA 
ATAATGTAAG 
ATTTTTTCCT 
TGTGATCAAT 
AATNNACCTA 
TTTCCGGCTT 
AATTCTCCCA 
TAGAAATTCC 



ATGATAGTCT 
ATGCTGTTGA 
CTCTTCTTTT 
TATCAGCTAT 
TCTTAAACTC 
AATCTACCAC 
TTATATAl'GT 
ATTCCTTCAA 
ATATTTCCTA 
AAGAACCTCC 
AGCTTGGGCC 
AAGTTAACAG 



GTACCACTAT 
GCTTACTTCT 
CCTCCCTTAT 
AJ^CACTTTGT 
ATCACAATCT 
CATATATTTC 
TTTAAAGACA 
NGAAACGTAA 
CCATNTCTNA 
TCTAAAGCAC 
TTNANNTGTA 
GNTANTTTTT 



ATTGAAGCTC 
ACCACCTTGC 
TTTATAATTG 
ATTCTTTTQT 
ATTTTCAAAT 

TAATAGTATA 
AAATTCAAAA 
TACNTTTCAA 
TCTAAGCAGA 
CTCCNTNA1>IG 
NTTTTTNTTN 



TTAGGTCATT 
TATTTCTCCC 

TTTGTGGTTC 
AATATCATAT 
TTCCATCCTA 
TGGGAGGTTT 
TAAATNTCTG 
GAATCTGAAG 
ATTAAGTCTT 
GTTAAAJJTTT 
TCNCC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
715 



(2) INFORMATION FOR SEQ ID NO: 265: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 
Jb) TYPE: nucleic acid' 
(C) STRANDEDNESS: single 
(dT TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: 

^^^^^^^^^^ TTTCCCAACA CAAAGCACCA TTATCTTTCC TCACAATTTT CAACATAGTT 6 0 

TGATTCCCAT GAAGAGGTTA TGATTTCTAA AGAAAACATG GCTACTATAC TATCAATCAG 12 0 

GGTTAAATCT TTTTTTTTTG AGACGGAGTT TA 152 

(2) INFORMATION FOR SEQ ID NO: 266: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DiESCRIPtlON : SEQ ID NO: 2 66: 

TAAACTCCGT CCCCTTCTTA ATCAATATGG AGGCTACCCA CTCCACATTA CCTTCTTTTC 6 0 

AAGGGACTGT TTCCGTAACT GTTGTGGGTA TTCACGACCA GGCTTCTAAA CCTCTTAAAA 12 0 

CTCCCCAATT CTGGTGCCAA CTTGGACAAC ATGCTTTTTT XTTTTTTTTT TTTTTTTTTN 180 

GAGACGGAGT TTA 193 

(2) INFORMATION FOR SEQ ID NO: 267: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 
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TGTTGCGATC CCTTAAGCAT GGGTGCTATT .^A.:^AAJ^TCG TGGAGAAGAA AATACCTGGA 6 0 

ATTTACGTCT TATCTTTAGA GATTGGGAAG ACCGTGATGG AGGACGTGGA GAAGAGGTTC 120 

TTCTTGAATG TC.AA.TTCCCA AGTA.C.CAACA GTGTGTCAGG CACTTGCTAA GGATCCTAAA 180 

TTGCAGC^^G GCTACA/^TGC TATGGGATTC TCCCAGGGAG GCCAATTTCT GAGGGCAGTG 24 0 

GCTCAGAGAT GCCGTTCACC TCCCATGATC AJ\TCTGATCT CGGTTGGGGG ACAACATCAA 3 00 

GGTGTTTTTG GACTCCCTCG ATGGCGAGGA GAGAGCTCTC AGATGTGTGA CTTGATCGGA 3 60 

AAAAGACTGA ATGCTGGGGG GTAGTCCAAA GTTGTTCAGG AACGCCTCGT GCAAGCCGAA 4 20 

TACTGGCATG ACCGATAAAJ^ GGAGGATGTG GATCGCAACA 4 60 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base paix'^ 

{B; TYPE: nucleic acid 
iC) STRAJJDEDNESS : single 
(Di TOPOLOGY: linear 

(XI ) SEOi:z::CE description: SEQ id no : 268: 

TGTTGCGATC CGTTGATAGA ATAGCGACGT GGTAATGAGT GCATGGCACG CCTCCGACTT 6 0 

ACGTTCGCCC GTGGGGACCC CGAGTACGTC TACGGCGTCG TCACTTAGAG TACCCTCTGG 12 0 

ACGCCCGGGC GC'JTTCGATT TACCGGAAGC GCGAGCTGCA GTGGGCTTGC GCCCCCGGCC 18 0 

AAATTCTTTG GGGGGTTT.A.^ GGCCGCGGGG A-^TTTGAGGT ATCTCTATCA GIATGTAGCC 24 0 

AAGTTGGAAC AGTCGCCATT CCCGAAA I'CG CTTTCTTTGA ATCCGCACCG CCTCCAGCAT 3 00 

TGCCTCATTC ATCAACCTGA AGGCACGCAT AAGTGACGGT TGTGTCTTCA GCAGCTCCAC 3 60 

TCCATAACTA GCGCGCTCGA CCTCGTCTTC GTACGCGCCA GGTCCGTGCG TGCGAATTCC 42 0 

CAACTCCGGT GAG7TGCGCA TTTCAAGTTN CGAAACTGTT CGCCTCCACN ATTTGGCATG 4 80 

TTCACGCATG ACACGGi^^ATA AACTCGTCCA GTACCGGGAA TGGGATCGCA ACA 53 3 

(2) INFORMATION FOR SEQ ID NO: 2 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) 3TRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 269: 
^^^^^^^^^^ TTCGCCTGAA TTAGCTACAG ATCCTCCTCA C.AAGCGGTCA 5 0 

(2) INFORMATION FOR SEQ ID NO: 2 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS . single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:270: 

TGTTGCGATC CAA_ATAACCC ACCAGCTTCT TGCACACTTC GCAGAAGCCA CCGTCCTTTG 60 
GCTGAGTCAC GTG;^ACGGTC AGTGCAAGCA GCCGCGTGCC AGAGCAGAGG TGCAGCATGC 120 
TGCACACCAG CTCAGGGCTG ACCTCCTCCA GCAGGATGGA CAGGATGGAG CTGCCGTACG 180 
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TGTCCACCAC 
CAAAAGCGTC 
TCACCAGGAA 
TAATGGGCTC 
CCAGAGTCTG 
CCTTGGGTTG 



CTCCTGGCAC TCTTCCGACA GGGACTTCGG CAGCTTCGAG CACATTTTGT 

GAGTATTTCT TTCTCAGTCT TGTTGTTGTC AATCAGCTTG GTCACCTCCT 

TTCACACACC TCACAGTAAA CATCAGACTT TGCTGGGACC TCGTGCTTCT 

CACCAGTTCC AGGGCAGGGA TGAGATTCTT GGAGGCCACT TTGGCGGGGA 



CATGGGCATC TCTTTCACCT CATCACAGAA CCCAACCAGC GCACAGATCT 
CATGTGCATC ATCATCTGGG ATCGCAACA 



300 
360 
420 
480 



240 



519 



(2) INFORMATION FOR SEQ ID NO: 2 71:. 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 57 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single • 

(D) TOPOLOGY: linear 

^(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

TTTTTTTTTT TTCGGGCGGC GACCGGACGT GCACTCCTCC AGTAGCGGCT GCACGTCGTG 6 0 

CCAATGGCCC GCTATGAGGA GGTGAGCGTG TCCGGCTTCG AGGAGTTCGA CCGGGCCGTG 12 0 

GAACAGCACA ATGGCAAGAC CATTTTCGCC TACTTTACGG GTTCTAAGGA CGCCGGGGGG 180 

AAAAGCTGGT GCCCCGACTG CGTGCAGGCT GAACCAGTCG TACGAGAGGG GCTGAAGCAC 24 0 

ATTAGTGAAG GATGTGTGTT CATCTACTGC CAAGTAGGAG AAGAGCGTTA TTGGAAAGAT 3 00 

CCAAATAATG ACTTCAGAAA AAACTTQAAA GTAACAGCAG iTGCCTACACT ACTTAAGTAT 3 60 

GGAACACCTC AAAAA.CTGGT :AGAATCTGAG :,TGTCTTCAGG CCAACCTGGT GGAAATGTTG 420 

TTCTCTGAAG ATTAAGATTT TAGGATGGCA - ATGAAGA 4 57 

(2) I-NFOR^4ATION FOR SEQ ID NO: 272: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: 

TTTTTTTTTT TTGGGCAACA ACCTGAATAC CTTTTCAAGG CTCTGGCTTG GGCTCAAGCC 60 
CGCAGGGGAA ATGCAACTGG CCAGGTCACA GGGCAATCAA GA 102 

(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 455 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

Ixi) SEQUENCE DESCRIPTION: SEQ ID NQ:273: 

^^^^^^^^^^ TTGGCAATCA ACAGGTTTAA GTCTTCGGCC GAAGTTAATC TCGTGTTTTT 60 

GGCAATCAAC AGGTTTAAGT CTTCGGCCGA AGTTAATGTC GTGTTTTTGG CAATCAACAG 12 0 

GTTTAAGTCT TCGGCCGAAG TTAATCTCGT GTTTTTGGCA ATCAACAGGT TTAAGTCTTC 180 

GGCCGAAGTT AATCTCGTGT TTTTGGCAAT CAACAGGTTT AAGTCTTCGG CCGAAGTTAA 24 0 

TCTCGTGTTT TTGGCAATCA ACAGGTTTAA GTCTTCGGCC GAAGTTAATC TCGTGTTTTT 3 00 

GGCAATCAAG AGGTTTAAGT. CTTCGGCCGA AG.TTAATCTC GTGTTTTTGG CAATCAACAG 3 60 
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GTTTAAGTCT TCGGCCGAAN TTAATCTCGT GTTTTTGGCA ATCAACAGGT TTAANTCTTC 42 0 

GGCCGAAGTT A.a.TCTCGTGT TTTTGGCAAT CAA1>JA 455 

(2) INFORMATION FOR SEQ ID NO: 2 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

TTTTTTTTTT TTGGCCAATA CCCTTGATGA ACATCAATGT GAAAATCCTC GGTAAAATAC 6 0 

TGGCAAACCA AATCCAGCAG CACATCAAAA AGCTTATCCA CCATGATCAA GTGGGCTTCA 12 0 

TCCCTGGGAT GCAAGGCTGG TTCAACATAA GAAAATCAAT AAATGTAATC CATCACATAA 18 0 

ACAGAACCAA AGACAAAAAC CACATGATTA TCTC-WTAGA TGCAGAAAAG GCCTTGGACA 24 0 

AATTCAACAG CCCTTCATGC TAAACACTCT TAATAAACTA GATATTGATG GAATGTATCT 3 00 

CAAAATAATA AGAGCTATTT ATGACAAACC CACAOCCAAT ATCATACTGA ATGGGCAAAG 36 0 

ACTGGAAGCA TTCCCTTTGA AAACTGGCAC .AAGACA.AGGA TGCCCTCTCT CACCGCTCCT 420 

ATTCAACATA GTATTGGAAG TTCTGGCCAG GGCAATCAAG A 4 61 

(2) INFORMATION FOR SEQ ID N0.27E: 

( .1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

TTTTTTTTTT TTGGCCAACA CCAAGTCTTC CACGTGGGAG GTTTTATTAT GTTTTACAAC 6 0 

CATGAAAACA TAGGAAGGTG GCTGTTACAG CAAACATTTC AGATAGACGA ATCGGCCAAG 12 0 

CTCCCCAAAC CCCACCTTCA CAGCCTCTTC CACACGTCTC CCANAGATTG TTGTCCTTCA 180 

CTTGCAAATT CANGGATGTT GGAAGTNGAC ATTTNNAGTN GCNGGAACCC CATCAGTGAA 240 

NCANTAAGCA GAANTACGAT GACTTTGANA NACANCTGAT GAAGAACACN CTACNGANAA 3 00 

CCCTTTCTNT CGTGTTANGA TCTCNNGTCC NTCACTAATG CGGCCCCCTG CNGGTCCACC 3 60 

ATTTGGGAGA ACTCCCCCCN CGTTGGATCC CCCCTTGAGT NTCCCATTCT NGTCCCCCAN 4 20 

ACCNGNCTTG NGNGNCANTN CNNCCTCNCA CCNTGTTTCC CTGNNGTNAA AATNNGTTTT 4 80 

NCCGCCNCCC NAATTCCCAC CCNAATCACA GCGAANCCNG AAGGCCTTCN NAAGTGTTTA 54 0 

ANGCCCNGNG GTTTCCTCNT NTANTTGCAG CCTACCCTCC CNCTTNMNNT TNCGNGTTGG 600 

TCGCGCCCTG GNCNCGCCTN GTTCCTCTTT NNGGNNACAA CCTNGNTCNN NGGCNCNTCN 6 60 

NNNCTNTTCC TNNNACTAGC TNGCCTNTCC NCNCCGNGGN NCANNGCACA TTNCNCNNAC 720 
TNTGTNNCC 



(2) INFORMATION FOR SEQ ID NO: 2 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 



729 
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TGACCTGACA TGTAGTAGAT ACTTAATAAA TATTTGTGGA ATGAATGGAT GAAGTGGAGT 6 0 

TACAGAGAAA A7vTAGAAAAG TACAAATTGT TGTCAGTGTT TTGAAGGAAA ATTATGATCT 12 0 

TTCCCAAAGT TCTGACTTCA TTCTAAGACA GGGTTAGTAT CTCCATACAT AATTTTACTT 180 

GCTTTTGAAA ATCAAATGAG ATAATCTATT TAGATTGATA ATTTATTTAG ACTGGCTATA 24 0 

AACTATTAAG TGCTAGCAAA TATACATTTT AATCTCATTT TCCACCTCTT GTGATATAGC 3 00 

TATGTAGGTG TTGACTTTAA TGGATGTCAG GTCAATCCC 33 9 

(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ^ ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:277: 

TGACCTGACA TCCATAACAA AATCTTTCTC CATTATATTC TTCTAGGGGA ATTTCTTGAA 6 0 

AAGCATCCAA AGGAAACf-AA TGATGGTAAG ACCGTGCCAA GTGGGGAGCA GACACCAAAG 12 0 

TAAGACCACA GATTTTACAT ' TCAACAGGTA GCTGACAGTA CTTTGCCCGA CACTGTGGGC 180 

AGAAATAGCC TCCTAATGTA AGCCCTGGCT CAGTATTGCC ATCCAAATGC GCCATGCTGA 24 0 

AAGAGGGTTT TGCATCCTGG TCAGATNAAG AAGCAATGGT ^GTGCTGAGGA AATCCCATAC 3 00 

GAATAAGTGA GCATTCAGAA CTTGAGCTAG CAGGAGGAGG ACTAAGATGA TGTGTGAGCA 3 60 

ACTCTTTGTA ATGGCTTTCA TCTAAAATAA CATGCJTACGT GCCACCAGTT TCACGAGCAA 42 0 

GTACAGTGCA AACGCGAA.CT TCTGCAGACA ATCCAATAAC AGATACTCTA 'aTTTTAGCTG 4 30 

CCTTTAGGGT CTTGATT^J\A TCATAAATAT TAGATGGATC GCAAGTTGTA AGGNTGCTAA 54 0 

AAGATGATTA GTACTTCTCG ACTTGTATGT CCAGGCATGT TGTTTTAAAN TCTGCCTTAG 6 00 

NCCCTGCTTA GGGGAATTTT TAAAGAAGAT GGCTCTCCAT GTTCANGGTC AATCACNAAT 660 

TGCC 664 

(2) INFOR^4ATION FOR SEQ ID NO:278-: 

(i) SEQUENCE .CHARACTERISTICS : - ' 

(A) LENGTH.: 4 52 base pairs 

(B) TYPE: . nucleic acid>' 

(C) STRAIJDEDNESS :. single ' -'^ 

(D) TOPOLOGY: linear ' '.. - • - •■ " - 

(xi) SEQUENCE DESCRI'P.TION: SEQ ID NO:278: - ' ' 

TGACCTGACA TTGAGGAAOA CCACACACGT CTGAAATTCC TTAGGTTCAG AAGGGCATTT 6 0 

GACACAGAGT GGGCCTGTGA TAATTCATGA AATGCATTCT GAAGTCATCC AGAATGGAGG 12 0 

CTGCAATCTG CTGTGCTTTG GGGGTTGCCT CACTGTGCTC CTGGATATCA CACAAAAGCT 180 

GCAATCCTTC TTCTTCAACT AACATTTTGC AGTATTTGCT GGGATTTTTA CTGCAGACAT 24 0 

GATACATAGC CCATAGTGCC CAGAGCTGAA CCTCTGGTTG AGAGAAGTTG CCAAGGAGCG 300 

GGAAAAATGT CTTGAAAGAT CTATAGGTCA CCAATGCTGT GATCTTACAA CTTGAACTTG 36 0 

GCCAATTCTG TATGGTTGCA TGCAGATCTT GG AGAAGAGT ACGCCTCTGG AAGTCACGGG 42 0 

ATATCCAAAN CTGTCTGTCA GATGTCAGGT ca 4 52 

(2) INFORMATION FOR SEQ ID NO: 279: . - . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 



TTTTTTTTTT TTCGGCAAGG CAAATTTACT TCTGCAAAAG GGTGCTGCTT GCACTTTTGG 6 0 

CCACTGCGAG AGCACACCAA ACAAAGTAGG GAAGGGGTTT TTATCCCTAA CGCGGTTATT 120 

CCCTGGTTCT GTGTCGTGTC CCCATTGGCT GGAGTCAGAC TGCACAATCT ACACTGACCC 180 

AACTGGCTAC TGTTTAAAAT TGAATATGAA TAATTAGGTA GGAAGGGGGA GGCTGTTTGT 24 0 

TACGGTACAA GACGTGTTTG GGCATGTCAG GTCA 2 74 



(2) INFORMATION FOR SEQ ID NO: 2 80: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 272 base pairs 

(B) TYPE:- nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 80: 



TACCTGACAT GGAG;^AAT/V^ CTTGTAGTAT :^TTGCGTGCA ATGG.AATACT ATATGAGGGT 6 0 

GAAAATGAAT GAACTAGCAA TGCGTGTATC AACATGAATA AATCCCCAAA ACATAATAAT 12 0 

GTTGAATGGA AAAGGTCAGT TTCAGAACGA TATATATGCC CTCTAAATCC ATTTATGTAA. 180 

ACCTTTAAAA AACT.-.CATTA TTTATGGTCA TAAGTCCATC CAGAAAATAT TTAAAAACCT 24 0 

ACATGGGATT GAT;iACTACT GATGTCAGGT CA 2 72 



(2) INFORMATION FOR SEQ ID NO: 2 81: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 31 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



TTTTTTTTTT TTGGCCAATA GCATGATTTA AACATTGGAA AAAGTCAAAT GAGCAATGCG 60 

AATTTTTATG TTCTCTTGAA TAATCAAAAG AGTAGGCAAC ATTGGTTCCT CATTCTTGJ^A 120 

TAGCATTAAT CAGAAAATAT TGCATAGCCT CTAGCCTCCT TAGAGTAGGT GTGCTCTCTC 18 0 

AAATATATCA TAGTCCCACA GTTTATTTCA TGTATATTTT CTGCCTGAAT CACATAGACA 24 0 

TTTGAATTTG CPxACGCCTGA TGTAAATATA TAAATTCTTA CCAATCAGAA ACATAGCAAG 3 00 

AAATTCAGGG ACTTGGTCAT YATCAGGGTA TGACAGCATJA TCCCTGTARA AACACTGATA 3 60 

CACACTCACA CACGTATGCA ACGTGGAGAT GTCGCYTTWW KKKTWYWCWM RMRYCRWCGN 4 20 

AATCACTTAN N 431 



(2) INFORMATION FOR SEQ ID NO:282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 
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ATTCGATTCG ATGCTTGAGC CCAGGAGTTC AAGACTGCAG TGAGCCACTG CACTTCAGGC 6 0 

TGGACAACAG AGCGAGTCCC TGTGCCAAAA AAAAAAAA 9 8 

(2) INFORMATION FOR SEQ ID NO: 2 83: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 





TTCGCAAGCA 


CGTGCACTTT 


attgaatgac 


actgtagaca 


GGTGTGTGGG 


60 


TATAAACTGC 


TGTATCTAGG 


GGCAGGACCA 


agggggcagg 


ggcaacagcc 


CCAGCGTGCA 


120 


GGGCCA5CAT 


TGCACAGTGG 


ASTGCAAAGG 


ttgcaggcta 


tgggcggcta 


CTAVTAACCC 


180 


CGTTTTTCCT 


GTATTATCTG 


TAACATAATA 


tggtagactg 


tcacagagcc 


GAATWCCART 


240 


HACASGATGA 


ATCCAAWGGT 


CAYGAGGATG 


CCCASAATCA 


GGGCCCASAT 


STTCAGGCAC 


300 


TTGGCGGTGG 


GGGCATASGC 


CTGKGCCCCG 


GTCACGTCSC 


caaccwtcty 


CCTGTCCCTA 


360 


CMCTTGAWTC 


CNCNCCTTNN 


NNTNCCNTNA 


TNTGCCCGCC 


CNCCTCCTNG 


NGTCAACCNG 


420 


NATCTGCACT 


ANCTCCCTCN' CCC'CTTNTGG'^J^TCTCNTee 


TTCAANTAAN 


NTTATCCTTN 


480 


ACNCCCCCCT 


CNCCTTTCCe 


CTNCCNCCCN 


tnatcccngn 


MCCNCTATCA 


NTCNTNCCCT 


540 


CNCTNTNCTN 


CNN ATCGTTC 


cncctnntaa 


CTACNCTTTN 


NACNANNCCT 


CACTNATNCC 


600 


NGNNANTTCT 


TTCCTTCCCT 


CCCNACGCNN tGCGTGCGCC 


CGTCTNGCCT 


mraCTNCGNA 


660 


CCCNNACTTT 


ATTTACCTTT 


ncaccctagc 


nctctacttn 


ACCCANCCNC 


TCCTACCTCC 


720 


NGGNCCACCC 


NNCCCTNATC 


nctnnctctn 


TCNNCTCNTT 


CCCC 




764 



(2) INFORMATION FOR SEQ ID NO: 2 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid - - : 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CAAGTGTAGG^ CACAGTGATG AAAGCCTGGA GCAAACACAA TCTGTGGGTA ATTAACGTTT 6 0 

ATTTCTCCCC TTCCAGGAAC GTCTTGCATG' GATGATCAAA- GATCAGCTCC TGGTCAACAT 12 0 

AAATAAGCTA GTTTAAGATA CGTTCCCCTA CACTTGA 157 

(2) INFORMATION FOR SEQ ID NO: 2 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 285: 

ATTCGATTGT ACTCAGACAA CAATATGCTA AGTGGAAGAA GTCAGTCACA AAAGACCACA 6 0 

TACTGTATGA CTTCATTTAC ATTAAGTGTC CAGAATAGGC AAATCCGTAG AGACAGAAAG 120 
TAGATGAGCA GCTGCCTAGG TCTGAGTACA 150 
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(2) INFORMATION FOR SEQ ID NO: 286: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

ATTCGATTTT TTTTTTTTTG GCCATGATGA AATTCTTACT CCCTCAGATT TTTTGTCTGG 6 0 

ATAAATGCAA GTCTCACCAC CAGATGTGAA ATTACAGTAA ACTTTGAAGG AATCTCCTGA 12 0 

GCAACCTTGG TTAGGATCAA TCCAATATTC ACCATCTGGG AAGTCAGGAT GGCTGAGTTG 18 0 

CAGGTCTTTA CAAGTTCGGG CTGGATTGGT CTGAGTACA 219 

(2) INFORf^TION FOR SEQ ID NO: 2 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 196 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJsfDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 87: 

ATTCGATTCT TGAGGCTACC AGGAGCTAGG AGAACAGGCA TGGAACAAAT TTTCCCTCAT 6 0 

ATCCATACTC AGAAGGA.ACC AACCCTGCTG ACACCTTAAT TTCAGCTTCT GGCCTCTAGA 12 0 

ACTGTGAGAG AGTACATTTC TCTTGGTTTA AGCCAACAGA ATCTGTCTTT TGGTACTTTA 18 0 

TATCATAGCC TCAAGA -^^^ 

(2) INFORMATION FOR SEQ ID NO: 2 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 8: 

ATTCGATTTC AGTCCAGTCC CAGAACCCAC ATTGTCAATT ACTACTCTGT ARAAGATTCA 6 0 

TTTGTTGAAA TTCATTGAGT AAAACATTTA TGATCCCTTA ATATATGCCA ATTACCATGC 12 0 

TAGGTACTGA AGATTCAAGT GACCGAGATG CTAGCCCTTG GGTTCAAGTG ATCCCTCTCC 18 0 

CAGAGTGCAC TGGACTGAA 19 9 

(2) INFORMATION FOR SEQ ID NO: 2 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI>nDEDNESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 89: 
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ATTCGATTCT TGAGGCTACA AACCTGTACA GTATGTTACT CTACTGAATA CTGTAGGCAA 6 0 

TAGTAATACA GAAGCAAGTA TCTGTATATG TAAACATTAA AAAGGTACAG TGAAACTTCA 12 0 

GTATTATAAT CTTAGGGACC ACCATTATAT ATGTGGTCCA TCATTGGCCA AAAAAAAAAA 18 0 

^ 182 

(2) INFORMATION FOR SEQ ID NO: 2 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1646 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

Ui) SEQUENCE DESCRIPTION: SEQ' ID NO : 290- 

GGCACGAGGA GAAATGTAAT TCCATATTTT ATTTGAAACT TATTCCATAT TTTAATTGGA 6 0 

TATTGAGTGA TTGGGTTATC AAACACCCAC AAACTTTAAT TTTGTTAAAT TTATATGGCT 12 0 

TTGAAATAGA AGTATAAGTT GCTACCATTT TTTGATAACA TTGAAAGATA' GTATTTTACC 180 

ATCTTTAATC ATCTTGGAAA ATACAAGTCC TGTGAACAAC CACTCTTTCA CCTAGCAGCA 24 0 

TGAGGCCAAA AGTAAAGGCT TTAAATTATA ACATATGGGA TTCTTAGTAG TATGTTTTTT 3 00 

TCTTGAAACT CAGTGGCTCT ATCTAACCTT ' AGTATCTCCT CACTCTTTCT CTAAGACTAA 36 0 

ACTCTAGGCT CTTAAAAATC TGCCCACACC AATCTTAGAA GCTCTGAAAA GAATTTGTCT 42 0 

TTAAATATCT TTTAATAGTA ACATGTATTT TATGGACCAA ATTGACATTT TCGACTATTT 4 80 

TTTCCAAAAA AGTCAGGTGA ATTTCAGCAC. ACTGAGTTGG GAATTTCTTA TCCCAGAAGA 54 0 

CCAACCAATT TCATATTTAT TTAAGATTGA TTCCATACTC CGTTTTCAAG GAGAATCCCT 600 

GCAGTCTCCT TAAAGQTAGA ACAAAXACTT TCTATTTTTT TTTCACCATT GTGGQATTGG 66 0 

ACTTTAAGAG- GTGACTCTiV^ AAAAAGAGAG AACAAATATG TCTCAGTTGT ATTAAQCACG 72 0 

GACCCATATT. ATCATATTCA CTTAAAAAAA TGAtTTCCTG TGCACCTTTT ' GGCAACTTCT 780 

CTTTTCAATG TAGGGAAAAA CTTAGTCACC CTGAAAACCC ACAAAATAAA TAAAACTTGT 84 0 

AGATGTGGGC AGAAGGTTTG GGGGTGGACA TTGTATGTGT TTAAATTAAA CCCTGTATCA 90 0 

CTGAGAAGCT GTTGTATGGG TCAGAGAAAA TGAATCCTTA GAAGCTGTtC ACATCTTCAA 96 0 

GAGCAGAAGC AAACCACATG TCTCAGCTAT ATTATTATTT ATTTTTTATG CATAAAGTGA 102 0 

ATCATTTCTT CTGTATTAAT TTCCAAAGGG TTTTACCCTC TATTTAAATG CTTTGAAAAA 1080 

CAGTGCATTG ACAATGGGTT GATATTTTTC TTTAAAAGAA AAATATAATT ATGAAAGCCA 114 0 

AGATAATCTG AAGCCTGTTT TATTTTAAAA CTTTTTATGT TCTGTGG^TTG ATGTTGTTTG 120 0 

TTTGTTTGTT TCTATTTTGT TGGTTTTTTA CTTTGTTTTT TGTTTTGTTT TGTTTTGTTT 12 6 0 

KGCATACTAC ATGCAGTTCT TTAACCAATG TCTGTTTGGC TAATGTAATT AAAGTTGTTA 13 2 0 

ATTTATATGA GTGCATTTCA ACTATGTCAA TGGTTTCTTA ATATTTATTG TGTAGAAGTA 13 80 

CTGGTAATTT TTTTATTTAC AATATGTTTA AAGAGATAAC AGTTTGATAT GTTTTCATGT 14 4 0 

GTTTATAGCA GAAGTTATTT ATTTCTATGG CATTCCAGCG GATATTTTGG TGTTTGCGAG 150 0 

GCATGCAGTC AATATTTTGT ACAGTTAGTG GACAGTATTC AGC.AACGCCT GATAC^CTTCT 156 0 

TTGGCCTTAT GTTAAATAAA AAGACCTGTT TGGGATGTAT TTTTTATTTT ' TAAAAAAAAA 162 0 

AAAAAAAAAA AAAAAAAAAA AAAAAA 1646 

(2) INFORMATION FOR SEQ ID NO: 2 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1851 base pairs 

(B) TYPE: nucleic acid 
(C; STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 91: 



BNSCXDCfD <WO 9845328A2 I 



wo 98/45328 FCT/US98/06939 

136 



TCATCACCAT TGCCAGCAGC GGCACCGTTA GTCAGGTTTT CTGGGAATCC CACATGAGTA 6 0 

CTTCCGTGTT CTTCATTCTT CTTCAATAGC CATAAATCTT CTAGCTCTGG CTGGCTGTTT 12 0 

TCACTTCCTT TAAGCCTTTG TGACTCTTCC TC7'G;^TGTCA GCTTTAAGTC TTGTTCTGGA 180 

TTGCTGTTTT CAGAAGAGAT TTTTA;i.CATC TCTTTTTCTT TGTAGTCAGA AAGTAACTGG 24 0 

CAAATTACAT GATGATGACT AGAA.^CAGCA TACTCTCTGG CCGTCTTTCC AGATCTTGAG 3 00 

AAGATACATC AAC.ATTTTGC TCAAGT'AGAG GGCTGACTAT ACTTGCTGAT CCACAACATA 3 60 

CAGCAAGTAT GAGAGCAGTT CTTCCATATC TATCCAGCGC ATTTAAATTC GCTTTTTTCT 4 20 

TGATTAAAAA TTTCACCACT TGCTGTTTTT GCTCATGTAT ACCAAGTAGC AGTGGTGTGA 4 80 

GGCCATGCTT GTTTTTTGAT TCGATATCAG CACCGTATAA GAGCAGTGCT TTGGCCATTA 54 0 

ATTTATCTTC ATTGTAGACA GCATAGTGTA GAGTGGTATT TCCATACTCA TCTGGAATAT 6 00 

TTGGATCAGT GCCATGTTCC AGCAACATTA ACGCACATTC ATCTTCCTGG CATTGTACGG 66 0 

CCTTTCTCAG AGCTGTCCTC TTTTTGTTGT CAAGGACATT AAGTTGACAT CGTCTGTCCA 72 0 

GCAGGAGTTT TACTACTTCT GAATTCCCAT TGGCAGAGGC CAGATGTAGA GCAGTCCTCT 7 80 

TTTGCTTGTC CCTCTTGTTC ACATCCGTGT CCCTGAGCAT GA.CGATGAGA TCCTTTCTGG 84 0 

GGACTTTACC CCACCAGGCA GCTCTGTGGA GCTTGTCCAG ATCTTCTCCA TGGACGTGGT 90 0 

ACCTGGGATC CArG.AAGGCG CTGTCATCGT AGTCTCCCCA AGCGACCACG TTGCTCTTGC 96 0 

CGCTCCCCTG CAGCAGGGGA AGCAGTGGCA GCACCACTTG CACCTCTTGC TCCCAAGCGT 1020 

CTTCACAGAG GAGTCGTTGT GGTCTCCAGA AGTGCCCACG TTGCTCTTGC CGCTCCCCCT 108 0 

CTCCATCCAG GGAGGAAGAA ATGCAGGAAA TGAAAGATGC ATGCACGATG GTATACTCCT 114 0 

CAGCCATCAA ACTTCTGGAC AGCAGGTCAC TTCCAGCAAG GTGGAGAAAG CTGTCCACCC 12 00 

ACAGAGGATG AGATCCAGAA ACCACAATAT CCATTCACAA ACAAACACTT TTCAGCCAGA 1260 

CACAGGTACT GJ^J-J^TCATGT CATCTGCGGC A/^.CATGGTGG AJVCCTACCCA ATCACACATC 13 2 0 

AAGAGATG/^A GACACTGCAG TATATCTGCA CAACGTAATA CTCTTCATCC ATAACAAAAT 1380 

AATATAATTT TCCTCTGGAG CCATATGGAT G.\ACTATGAA GGAAGAACTC CCCGAAGAAG 144 0 

CCAGTCGCAG AGAAGCCACA CTGAAGCTCT GTCCTCAGCC ATCAGCGCCA CGGACAGGAP 1500 

TGTGTTTCTT CCCCAGTGAT GCAGCCTCAA GTTATCCCGA AGCTGCCGCA GCACACGGTG 156 0 

GCTCCTGAGA AACACCCCAG CTCTTCCGGT CTAACACAGG CAAGTCAATA AATGTGATAA 162 0 

TCACATAAAC AGAJ=iTTAAAA GCAAAGTCAC ATAAGCATCT CAACAGACAC AGAAAAGGCA 1680 

TTTGACAAAA TCCAGCATCC TTGTATTTAT TGTTGCAGTT CTCAGAGGAA ATGCTTCTAA 174 0 

CTTTTCCCCA TTTAGTATTA TGTTGGCTGT GGGCTTGTCA TAGGTGGTTT TTATTACTTT 1800 

AAGGTATGTC CCTTCTATGC CTGTTTTGCT GAGGGTTTTA ATTCTCGTGC C 1851 



(2) INFORMATION FOR SEQ ID NO: 2 92: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJ^Ii'EDNESS : Single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 



TCATCACCAT TGCCAGCAGC GGCACCGTTA GTCAGGTTTT CTGGGAATCC CACATGAGTA 60 

CTTCCGTGTT CTTCATTCTT CTTCAATAGC CATAAATCTT CTAGCTCTGG CTGGCTGTTT 12 0 

TCACTTCCTT TAAGCCTTTG TGACTCTTCC TCTGATGTCA GCTTTTUVGTC TTGTTCTGGA 18 0 

TTGCTGTTTT CAGAAGAGAT TTTTAACATC TGTTTTTCTT TGTAGTCAGA AAGTAACTGG 24 0 

CAAATTACAT GATGATGACT AGAAACAGCA TACTCTCTGG CCGTCTTTCC AGATCTTGAG 3 00 

AAGATACATC AACATTTTGC TCAAGTAGAG GGCTGACTAT ACTTGCTGAT CCACAACATA 36 0 

CAGCAAGTAT GAGAGCAGTT CTTCCATATC TATCCAGCGC ATTTAAATTC GCTTTTTTCT 42 0 

TGATTAAAAA TTTCACCACT TGCTGTTTTT GCTCATGTAT ACCAAGTAGC AGTGGTGTGA 480 

GGCCATGCTT GTTTTTTGAT TCGATATCAG CACCGTATAA GAGCAGTGCT TTGGCCATTA 54 0 

ATTTATCTTC ATTGTAGACA GCATAGTGTA GAGTGGTATT TCCATACTCA TCTGGAATAT 600 

TTGGATCAGT GCCATGITCC AGCAACATTA ACGCACATTC ATCTTCCTGG CATTGTACGG 660 

CCTTTGTCAG AGCTGTCCTC TTTTTGTTGT CAAGGACATT AAGTTGACAT CGTCTGTCCA 72 0 

GCAGGAGTTT TACTACTTCT GAATTCCCAT TGGCAGAGGC CAGATGTAGA GCAGTCCTCT 78 0 
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TTTGCTTGTC CCTCTTGTTC ACATCCGTGT CCCTGAGCAT GACGATGAGA TCCTTTCTGG 84 0 

GGACTTTACC CCACCAGGCA GCT'CTGTGGA GCTTGTCCAG ATCTTCTCCA TGGACGTGGT 90 0 

ACCTGGGATC CATGAAGGCG CTGTCATCGT AGTCTCCCCA AGCGACCACG TTGCTCTTGC 96 0 

CGCTCCCCTG CAGCAGGGGA AGCAGTGGCA GCACCACTTG GACCTCTTGC TCCCAAGCGT 102 0 

CTTCACAGAG GAGTCGTTGT GGTCTCCAGA AGTGCCCACG TTGCTCTTGC CGGTCCCCCT 1080 

GTCCATCCAG GGAGGAAGAA ATGCAGGAAA TGAAAGATGC ATGCACGATG GTATACTCCT 114 0 

CAGCCATCAA ACTTCTGGAC AGCAGGTCAC TTCCAGCAAG GTGGAGAAAG CTGTCCACCC 12 00 

ACAGAGGATG AGATCCAGAA ACCACAATAT CCATTCACAA ACAAACACTT TTCAGCCAGA 12 6 0 

CACAGGTACT GAAATCATGT CATCTGCGGC AACATGGTGG AACCTACCCA ATCACACATC 13 20 

AAGAGATGAA GACACTGCAG TATATGTGCA CAACGTAATA CTCTTC/^TCC ATAACAAAAT 13 80 

AATATAATTT TCCTCTGGAG -CCATATGGAT GAACTATGAA GG AAG AACTC CCCGAAGAAG 14 4 0 

CCAGTCGCAG AGAA:GCCACA CTGAAGCTCT GTCCTCAGCC ATCAGCGCCA CGGACAGGAP. 15 0 0 

TGTGTTTCTT CCCCAGTGAT GCAGCCTCAA GTTATCCCGA AGCTGCCGCA GCACACGGTG 156 0 

GCTCCTGAGA AACACCCCAG CTCTTCCGGT CTAACACAGG CAAGTCAATA AATGTGATAA 162 0 

TCACATAAAC AGAATTAAAA GCAAAGTCAC ATAAGCATCT CAACAGACAC AGAAAAGGCA 168 0 

TTTGACAAAA TCCAGCATCC TTGTATTTAT TGTTGCAGTT CTCAGAGGAA ATGCTTCTAA. 174 0 

CTTTTCCCGA TTTAGTATTA TGTTGGCTGT GGGCTTGTCA TAGGTGGTTT TTATTACTTT 18 00 

AAGGTATGTC CCTTCTATGC CTGTTTTGCT GAGGGTTTTA ATTCTCGTGC C 1851 

(2), INFORMATION FOR SEQ ID NO:293: 

(i) SEQUENCE CIIARACTERISTICS": ' 

(A) LENGTH:* 66 8 ba's:e: pairs ' , ■ 

(B) ; TYPE: .nuclerc acid / \. 

(C) STRANDEDNESS ;. Single :.. 
ID) TOPOLOGY: linear ' ^ . . - ' 

(xi) SEQUENCE DESCRIPTION: - SEQ ID NO:293: 

CTTGAGGTTC CAAATAYGGA AGACTQGCCC TTACACASGT CAATGTTAAA ATGAATGCAT 6 0 

TTCAGTATTT TGAAGATAAA ATTRGTAGAT CTATACCTTG TTTTTTGATT CGATATCAGC 12 0 

ACCRTATAAG AGCAGTGCTT TGGCCATTAA TTTATCTTTC ATTRTAGACA GCRTAGTGYA 18 0 

GAGTGGTATT TCCATACTCA TCTGGAATAT TTGGATCAGT GCCATGTTGC ' AGCAACATTA 24 0 

ACGCACATTC ATCTTCCTGG CATTGTACGG CCTGTCAGTA TTAGACCCAA AAACAAATTA 3 00 

CATATCTTAG GAATTCAAAA TAACATTCCA CAGCTTTCAC CAACTAGTTA TATTTAAAGG 3 60 

AGAAAACTCA TTTTTATGCC ATGTATTGAA ATCAAACCGA CCTCATGCTG ATATAGTTGG 420 

CTACTGCATA CCTTTATCAG AGCTGTCCTC TTTTTGTTGT CAAGGACATT AAGTTGACAT 4 80 

CGTCTGTCCA GCAGGAGTTT TACTACTTCT GAJ^TTCCCAT TGGCAGAGGC CAGATGTAGA 54 0 

GCAGTCCTAT GAGAGTGAGA AGACTTTTTA GGAAATTGTA GTGCACTAGC TACAGCCATA 600 

GCAATGATTC ATGTAACTGC AAACACTGAA TAGCCTGCTA TTACTCTGCC TTCAAAAAAA 66 0 

AAAAAAAA g^g 

(2) INFORMATION FOR SEQ ID NO:294: 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: X512 base pairs " . 

(B) TYPE: nucleic acid 

^ - (C) STRANDEDNESS single 
' (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:294 5 : 

GGGTCGCCCA GGGGGSGCGT GGGCTTTCCT CGGGTGGGTG TGGGTTTTGC CTGGOTGGGG 60 

TGGGCTGGGC TRGAATCCCC TGCTGGGGTT GGCAGGTTTT GGCTGGGATT GACTTTTYTC 120 

TTCAAACAGA TTGGAAACCC GGAGTTACCT GCTAGTTGGT GAAACTGGTT ' GGTAGACGCG 18 0 
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ATCTGTTGGC TACTACTGGC TTCTCCTCGC TGTTAAAAGC AGATGGTGGT TGAGGTTGAT 24 0 

TCCATGCCGG CTGCTTCTTC TGTGAAGAAG GCATTTGGTC TCAGGAGCAA GATGGGCAAG 3 00 

TGGTGCTGCC GTTGCTTCCC CTGCTGCAGG GAGAGCGGCA AGAGCAACGT GGGCACTTCT 3 60 

GGAGACCACG ACGACTCTGC TATGAAGACA CTCAGGAGCA AGATGGGCAA GTGGTGCCGC 420 

CACTGCTTCC CCTGCTGCAG GGGGAGTGGC AAGAGCAACG TGGGCGCTTC TGGAGACCAC 4 80 

GACGAYTCTG CTATGAAGAC ACTCAGGAAC AAGATGGGCA AGTGGTGCTG CCACTGCTTC 54 0 

CCCTGGTGCA GGGGGAGCRG CAAGAGCAAG GTGGGCGCTT GGGGAGACTA GGATGACAGT 6 00 

GCCTTCATGG AGCCGAGGTA CCA.CGTCCGT GGAGAAGATG TGGACAAGCT CCACAGAGCT 66 0 

GCCTGGTGGG GTAAAGTCCC CAGAAAGGAT CTCATCGTCA TGCTCAGG'.A CACTGACGTG 72 0 

AACAAGAAGG ACAAGCAAAA GAGGAGTGCT CTACATCTGG CCTCTGCC-\A TGGGAATTCA 7 80 

GAAGTAGTAA AACTCSTGCT GGACAGACGA TGTCAACTTA ATGTCCTTGA CAAGAAAAAG 84 0 

AGGACAGCTC TGAYAAAGGG GGTACAATGC CAGGAAGATG AATGTGCGTT AATGTTGCTG 900 

GAACATGGCA CTGATCCAAA TATTCCAGAT GAGTATGGAA ATACCACTCT RCACTAYGCT 960 

RTCTAYAATG AAGATAAATT AATGGCCAAA GCACTGCTCT TATAYGGTGC TGATATCGAA 102 0 

TCAAAAAACA AGGTATAGAT CTACTAATTT TATCTTCAAA ATACTGAAAT GCATTCATTT 1080 

TAACATTGAC GTGTGTAAGG GCCAGTCTTC CGTATTTGGA AGCTCAAGCA TAACTTGAAT 114 0 

GAAAATATTT TGAAATGACC TAATTATCTM AGACTTTATT TT/vAATATTG TTATTTTCAA 12 00 

AGAAGGATTA GAGGGTAGAG -pTTTTTTTTT TTAAATGGAG TTCTGGTAAA TACTTTTGTT 12 6 0 

GAAAAGAGTG AATTTGTAAA AGGTAATACT TACTATTTTT CAATTTTTCC GTGCTAGGAT 1320 

TTTTTTCCGC TAATGAATGT AA.GATGGCAA AATTTGGCCT GAAATAGGTT TTACATGAAA 13 80 

ACTCCAAGAA AAGTTAAACA TGTTTCAGTG AATAGAGATG CTGCTCGTTT GGCAAGTTGG 14 4 0 

TAAAAAACAG TAATAGATAC GAGGTGATGC GCCTGTCAGT GGGAA.GGTTT AAGATATTTC 1500 

TGATCTCGTG CO 1512 



(2) INFORMA.TION FOP. SEQ ID NO: 2.93: 

(i) SEOUENCE CHAJ^AGTER I ST I GS : 

(A) LENGTH: 1853 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJNDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DECCRIPTION: SEQ ID NO: 2 95: 



GGGTCGCCCA GGGGGSGCGT GGGCTTTCCT CGGGTGGGTG TGGGTTTTCC CTGGGTGGGG 60 

TGGGCTGGGC TRGAATCCCC TGCTGGGGTT GGCAGGTTTT GGCTGGGATT GACTTTTYTC 12 0 

TTCAAACAGA TTGGAAACCC GGAGTTACCT GCTAGTTGGT GAAACTGGTT GGTAGACGCG 180 

ATCTGTTGGC TACTACTGGC TTCTCCTGGC TGTTAAAAGC AGATGGTGGT TGAGGTTGAT 24 0 

TCCATGCCGG CTGCTTCTTC TGTGAAGAAG GCATTTGGTC TCAGGAGCAA GAT(?GGCAAG 3 00 

TGGTGCTGCC GTTGCTTCCC CTGCTGCAGG GAGAGCGGCA AGAGCAACGT GGGCACTTCT 3 60 

GGAGACCACG ACGACTCTGC TATGAAGACA CTCAGGAGCA AGATGGGCAA GTGGTGCCGC 42 0 

CACTGCTTCC CCTGCTGCAG GGGGAGTGGC AAGAGCAACG TGGGCGCTTC TGGAGACCAC 4 80 

GACGAYTCTG CTATGAAGAC ACTCA.GG.AAC AAGATGGGCA AGTGGTGCTG CCACTGCTTC 54 0 

CCCTGGTGCA GGGGGAGCRG CAAGAGCAAG GTGGGCGCTT GGGGAGACTA CGATGACAGY 60 0 

GCCTTCATGG AKCCCAGGTA CCACGTCCFT GGAGAAGATG TGG7\CAAGCT CCACAGAGCT 6 60 

GCCTGGTGGG GTAAAGTCCC CAGAAAGGAT CTCATCGTCA TGCTCAGGGA CACKGAYGTG 72 0 

AACAAGARGG ACAAGCAAAA GAGGAGTGCT CTACATCTGG CCTCTGCCAJ\ TGGGAATTCA 780 

GA-AGTAGTAA AACTCSTGCT GGACAGACGA TGTCAACTTA ATGTCCTTGA CAACAAAAA.G 84 0 

AGGACAGCTC TGAYAAAGGG GGTACAATGC CAGGAAGATG .AATGTGCGTT AATGTTGCTG 90 0 

GAACATGGCA CTGATCCAAA TATTCCAGAT GAGTATGGAA ATACCA.CTCT RCAC1A.YGCT 960 

RTCTAYAATG AAGATAAATT AATGGCCAAA GCACTGCTCT TATAYGGTGC TGATATCGAA 102 0 

TCAAAAAACA AGCATGGCCT CACACCACTG YTACTTGGTR TACATGAGCA AAAACAGCAA 10 8 0 

GTSGTGAAAT TTTTAATYAA GAP-JVAAAGCG AATTTAAAAT GCRCTGGATA GATATGGAAG 114 0 

RACTGCTCTC ATACTTGCTG TATGTTGTGG ATCAGCAAGT ATAGTCAGCC YTCTACTTGA 1200 

GCAAAATRTT GATGTATCTT CTCAAGATCT GGAAAGACGG CCA.GAGAGTA TGCTGTTTCT 1260 
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AGTCATCATC ATGTAATTTG CCAGTTACTT TCTGACTACA AAGAAAAACA GATGTTAAAA 13 2 0 

ATCTCTTCTG AAAACAGCAA TCCAGAACAA GACTTAAAGC TGACATCAGA GGAAGAGTCA 13 80 

CAAAGGCTTA AAGGAAGTGA AAACAGCCAG CCAGAGGCAT GGAAACTTTT AAATTTAAAC 14 4 0 

TTTTGGTTTA ATGTTTTTTT TTTTTGCCTT AATAATATTA GATAGTCCCA AATGAAATWA 1500 

CCTATGAGAC TAGGCTTTGA GAATCAATAG ATTCTTTTTT TAAGAATCTT TTGGCTAGGA 156 0 

GCGGTGTCTC ACGCCTGTAA TTCCAGCACC TTGAGAGGCT GAGGTGGGCA GATCACGAGA 16 2 0 

TCAGGAGATC GAGACCATCC TGGCTAACAC GGTGAAACCC CATCTCTACT AAAAJ^^TACAA 16 8 0 

AAACTTAGCT GGGTGTGGTG GCGGGTGCCT GTAGTCCCAG CTACTCAGGA RGCTGAGGCA 174 0 

GGAGAJ^TGGC ATGAACCCGG GAGGTGGAGG TTGCAGTGAG CCGAGATCCG CCACTACACT 18 00 

CCAGCCTGGG TGACAGAGCA AGACTCTGTC TCAAAAAAAA' AAAAAAAAAA AAA 13 53 

(2) INFORMATION FOR SEQ ID NO: 2 96: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2184 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi')-. SEQUENCE DESCRIPTION: SEQ " ID NO: 296: 



GGCACGAGAA 


TTAAAACCCT 


CAGCAAAACTi^- 


"gGcatagaag 


GGACATACCT 


tAaagtaata 


60 


AAAACCACCT 


ATGACAAGCC 


CACAGCCAAC 


ATAATACTAA 


ATGGGGAAAA 


GTTAGAAGCA 


120 


TTTCCTCTGA 


GAACTGCAAC 


AATAAATACA 


aggatgctgg 


ATTTTGTCAA 


ATGCCTTTTC 


180 


TGTGTCTGTT 


GAGATGCTTA 


TGTGACTTTG 


CTTTTAATTC 


-TGTTTAT(5TG 


ATTATCACAl* 


240 


TTATTGACTT 


GCCTGTGTTA 


GACCGGAAGA 


gctggggtgt 


TTCTCAGGAG 


CCACCGTGTG 


300 


CTGCGGCAGC 


TTCGGGATAJ\ 


CTTGAGGCTG 


CATCAGTGGG 


GAAGAAACAC 


AYTCCTGTCC 


360 


GTGGCGCTGA 


TGGCTGAGGA 


CAGAGCTTCA 


GTGTGGCTTC 


TCTGCGACTG 


GCTTCTTCGG 


420 


GGAGTTCTTC 


CTTCATAGTT 


CATCCATATG 


GCTCCAGAGG 


AAAATTATAT 


TATTTTGTTA 


480 


TGGATGAAGA 


GTATTACGTT 


GTGCAGATAT 


-^ACTGCAGTGT 


CTTCATCTCT 


TGATGTGTGA 


540 


TTGGGTAGGT 


TCCACCATGT 


TGCCGCAGAT 


GACATGATTT 


CAGTACCTGT 


gtctggctga 


600 


AAAGTGTTTG 


TTTGTGAATG 


GATATTGTGG 


TTTCTGGATC 


TCATCCTCTG 


TGGGTGGACA 


660 


GCTTTCTCCA 


CCTTGCTGGA 


AGTGACCTGC 


TGTCCAGAAG 


TTTGATGGCT" 


gaggagtata 


720 


CCATCGTGCA 


TGCATCTTTC 


ATTTCCTGCA 


TTTCTTCCTC 


CCTGGATGGA 


CAGGGGGAGC 


780 


GGCAAGAGGA 


ACGTGGGCAC 


TTCTGGAGAC 


CACAACGACT 


CCTCTGTGAA 


gacgcttggg 


840 


AGCAAGAGGT 


GCAAGTGGTG 


CTGCCACTGC 


TTCCCGTaCT 


GGAGGGGAGC 


ggcaagagca 


900 


ACGTGGTCGC 


TTGGGGAGAC 


TACGATGACA 


GCGCCTTCAT 


•GGATCCCAGG 


taccacgtcc 


960 


ATGGAGAAOA 


TCTGGACAAG 


CtCCACAGAG 


CTGCGTGGTG 


GGGTAAAGTC 


CCCAGAAAGG 


1020 


ATCTCATCGT 


CATGCTCAGG 


GACACGGATG 


TGAACAAGAG 


GGACAAGCAA 


aagaggactg 


1080 


CTCTACATCT 


GGCCTCTGCC 


AATGGGAATT 


CAGAAGTAGT 


AAAACTCGTG 


ctggacagac 


1140 


GATGTCAACT 


TAATGTCCTT 


GACAACAAAA 


AGAGGACAGC 


TCTGACAAAG 


gccgtacaat 


1200 


GCCAGGAAGA 


TGAATGTGCG 


TTAATGTTGC 


TGGAACATGG 


CACTGATCCA 


aatattccag 


1260 


ATGAGTATGG 


AAATACCACT 


CTACACTATG 


CTGTCTACAA 


TGAAGATAAA 


ttaatggcca 


1320 


AAQCACTGCT 


CTTATACGGT 


GCTGATATCG 


AATCAAAAAA 


CAAGCATGGC 


ctcacaccac 


1380 


TGCTACTTGG 


TATACATGAG 


CAAJVAACAGC 


AAGTGGTGAA 


ATTTTTAATC 


aagaaaaaag 


1440 


CGAATTTAAA 


TGCGCTGGAT 


AGATATGGAA 


GAACTGCTCT 


CATACTTGCT 


gtatgttgtg 


1500 


GATGAGCAAG 


TATAGTCAGC 


CCTCTACTTG 


AG'CAAAATGT 


TGATGTATCT 


tctcaagatc 


1560 


TGGAAAGACG 


GCCAGAGAGT 


ATGCTGTTTC 


TAGTCATCAT 


CATGTAJ\TTT 


gccagttact 


1620 


TTCTGACTAC 


AAAGAAAAAC 


AGATGTTAAA 


AATCTCTTCT 


GAAAACAGCA 


atccagaaca 


1680 


AGACTTAAAG 


CTGACATCAG 


AGGAAGAGTC' 


ACAAAGGCTT 


AAAGGAAGTG 


AAAACAGCCA 


1740 


GCCAGAGGGA 


TGGAAACTTT 


TAAATTT AAA 


CTTTTGGTTT 


AATGTTTTTT 


ttttttgcct 


1800 


TAATAATATT 


AGAl'AGTCCC 


AAATGAAATW 


ACCTATGAGA 


CTAGGCTTTG 


agaatcaata 


1860 


GATTCTTTTT 


TTAAGAATCT 


■ TTTGGCTAGG 


AGCGGTGTCT 


CACGCCTGTA 


attccagcac 


1920 


CTTGAGAGGC 


TGAGGTGGGC 


AGATCACGAG 


ATCAGGAGAT 


CGAGACCATC 


ctggctaaca 


1980 


CGGTGAAACC 


CCATCTCTAC 


TAAAAATACA 


AAAACTTAGC 


TGGGTGTGGT 


ggcgggtgcc 


2040 
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TGTAGTCCCA GCTACTCAGG ARGCTGAGGC AGGAGAATGG CATGAACCCG GGAGGTGGAG 210 0 

GTTGCAGTGA GCCGAGATCC GCCACTACAC TCCAGCCTGG GTGACAGAGC AAGACTCTGT 216 0 

CTCAAAAAAA AAAAJ^JkAAPJ^ AAAA 2 184 

(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i) SEQUENCE CHAR^.CTERI ST I CS : 

(A) LENGTH: 1855 base pairs 

(B) TYPE, nucleic acid 

(C) STRANDEDNES3 : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 97: 

TGCACGCATC GGCCAGTGTC TGTGCCACGT ACACTGACGC CCCCTGAGAT GTGCACGCCG 6 0 

CACGCGCACG TTGCACGCGC GGCAGCGGCT TGGCTGGCTT GTAACGGCTT GCACGCGCAC 12 0 

GCCGCCCCCG CAT;..:^CCGTC AGACTGGCCT GTAACGGCTT GCAGGCGCAC GCCGCACGCG 180 

CGTAACGGCT TGGCTGCCCT GTA.\CGGCTT GCACGTGCAT GCTGCACGCG CGTTAACGGC 2 •! 0 

TTGGCTGGCA TGTAGCCGCT TGGCTTGGCT TTGCATTYTT TGCTKGGCTK GGCGTTGKTY 3 00 

TCTTGGATTG ACGCTTCCTC CTTGGATKGA CGTTTCCTCC TTGGATKGAC GTTTCYTYTY 3 60 

TCGCGTTCCT TTGCTGGACT TGACCTTTTY TCTGCTGGGT TTGGCATTCC TTTGGGGTGG 4 20 

GCTGGGTGTT TTCTCCGGGG GGGKTKGCCC TTCCTGGGGT GGGCGTGGGK CGCCCCCAGG 4 80 

GGGCGTGGGC TTTC::CCGGG TGGGTGTGGG TTTTCCTGGG GTGGGGTGGG CTGTGCTGGG 54 0 

ATCCCCCTGC TGGGOTTGGC Ai^GGATTGAC TTTTTTCTTC AAACAGATTG GAAACCCGGA 6 00 

GTAACNTGCT AGTTGGTGAA ACTGGTTGGT AGACGCGATC TGCTGGTACT ACTGTTTCTC 660 

CTGGCTGTTA AAAGCAGATG GTGGCTGAGG TTGATTCAAT GCCGGCTGCT TCTTCTGTGA 72 0 

AGAAGCCATT TGGTCTCAGG AGCAAGATGG GCAAGTGGTG CGCCACTGCT TCCCCTGCTG 7 80 

CAGGGGGAGC CCCJ^J^.GAGCA ACGTGGGCAC TTCTGGAGAC CACAACGACT CCTCTGTGAA 84 0 

GACGCTTGGG AGCA_:.GAGGT GCAAGTGGTG CTGCCCACTG CTTCCCCTGC TGCAGGGGAG 9 00 

CGGCAAGAGC AACGTGGKCG CTTGGGGAGA CTACGATGAC AGCGCCTTCA TGGAKCCCAG 960 

GTACCACGTC CRTGGAGAAG ATCTGGACAA GCTCCACAGA GCTGCCTGGT GGGGTAAAGT 102 0 

CCCCAGAAAG GATCTCATCG TCATGCTCAG GGACACTGAY GTGAACAAGA RGGACAAGCA 108 0 

AAAGAGGACT GCTCTACATC TGGCCTCTGC CAATGGGAA.T TCAGAAGTAG TAAAACTCGT 114 0 

GCTGGACAGA CGATGTCAAC TTAATGTCCT TGACAACAAA AAGAGGACAG CTCTGACAAA 12 0 0 

GGCCGTACAA TGCCAGGAAG ATGAATGTGC GTTAATGTTG CTGGAACATG GCACTGATCC 12 60 

AAATATTCCA GATGAGTATG GAAATACCAC TCTACACTAT GCTGTCTACA ATGAAGATAA 13 20 

ATTAATGGCC AAAGCACTGC TCTTATACGG TGCTGATATC GAATCAAAAA ACAAGGTATA 13 80 

GATCTACTAA TTTTATCTTC AAAATACTGA AATGCATTCA TTTTAACATT GACGTGTGTA 14 4 0 

AGGGCCAGTC TTCCGTATTT GGAAGCTCAA GCATAACTTG AATGAAAATA TTTTGAAATG 15 00 

ACCTAATTAT CTAAGACTTT ATTTTAAATA TTGTTATTTT CAAAGAAGCA TTAGAGGGTA 15 60 

CAGTTTTTTT TTTTTAAATG CACTTCTGGT AAATACTTTT GTTGAAAACA CTGAATTTGT 162 0 

AAAAGGTAAT ACTTACTATT TTTCAATTTT TCCCTCCTAG GATTTTTTTC CCCTAATGAA 16 80 

TGTAAGATGG CAAAATTTGC CCTGAAATAG GTTTTACATG AAAACTCCAA GAAAAGTTAA 174 0 

ACATGTTTCA GTGAJ^TAGAG ATCCTGCTCC TTTGGCAAGT TCCTAAAAAA CAGTAATAGA 18 00 

TACGAGGTGA TGCGCCTGTC AGTdGCAAGG TTTAAGATAT TTCTGATCTC GTGCC 18 55 
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1. An isolated DNA molecule, comprising: 

(a) a nucleotide sequence selected from the group consisting of SEQ ID 
NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 
207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 
271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297; 

(b) a valiant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic and/or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained; or 

(c) a nucleotide sequence encoding an epitope of a polypeptide encoded by 
at least one sequence selected from the group consisting of SEQ ID NO:l, 3-26, 28-77, 142, 
143, 146-152, 154-166, 168-176, 178-192, 1 94- 19S, 200-204, 206, 207, 209-214, 216, 218, 
219, 221-240, 243-245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 
280, 28 1 , 284, 28S and 291-297. 

2. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said polypeptide is encoded by a nucleotide sequence that: 

(a) hybridizes to a sequence selected from the group consisting of SEQ ID 
NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178-192, 194-198, 200-204, 206, 
207, 209-214, 216, 218, 219, 221-240, 243-245. 247, 250, 251, 253. 255, 257-266, 268, 269, 
271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297 under stringent conditions; and 

(b) is at least 80% identical to a sequence selected from the group 
consisting of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 16S-176, 178-192, 
194-198. 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 253, 
255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297. 
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3. An isolated DNA molecule encoding an epitope of a polypeptide, 
wherein said poI>peptide is encoded by: 

(a) a nucleotide sequence transcribed from the sequence of SEQ ID NO: 

141;or 

(b) a variant of said nucleotide sequence that contains one or more 
nucleotide substitutions, deletions, insertions and/or modifications at no more than 20% of 
the nucleotide positions, such that the antigenic andy'or immunogenic properties of the 
polypeptide encoded by the nucleotide sequence are retained. 

4. An isolated DNA or RNA molecule comprising a nucleotide sequence 
complementary to a DNA molecule according to any one of claims 1-3. 

5. A recombinant expression vector comprising a DNA molecule 
according to any one of claims 1-3. 

6. A host cell transfomied or transfected with an expression vector 
according to claim 5. 

7. A polypeptide comprising an amino acid sequence encoded by a DNA 
molecule according to any one of claims 1-3. 

8. A polypeptide according to claim 7 wherein said polypeptide 
comprises an epitope of an amino acid sequence encoded by at least one nucleotide sequence 
selected from the group consisting of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154- 
166, 168-176, 178-192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243- 
245, 247, 250, 251, 253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 
and 291-297 

9. A monoclonal antibody that binds to a polypeptide according to 

claim 7. 
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10. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one polypeptide according to claim 
7, and therefrom determining the presence of breast cancer in the patient. 

11. A method for determining the presence of breast cancer in a patient 
compnsmg detecting within a biological sample, at least one polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 145, 153, 
167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 
274, 277, 279, 282, 283, 285^287, 289, 290 and sequences that hybridize thereto under 
stringent conditions. 

12. The method of clamis 10 or 11 wherem the biological sample is a 
portion of a breast tumor. 

13. The method of claim 10 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody according to claim 9. 

14. The method of claim 11 wherein the step of detecting compnses 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected i^rom the group consisting of SEQ ID NO: 78-86, 
144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 
256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize 
thereto under stringent conditions. 

; 15. A method for determining the presence of breast cancer in a patient 
comprising. detecung, within a biological sample, an RNA molecule encoding at least one 
polypeptide according to claim 7, and therefrom determining the presence of breast cancer in 
the patient. 
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lb. A method for determining the presence of breast cancer in a patient 
comprising detecting, within a biological sample, at least one RNA molecule encodmg a 
polypeptide encoded by a nucleotide sequence selected from the group consisting of SEQ ID 
NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 
249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that 
hybridize thereto under stringent conditions; and therefrom determining the presence of breast 
cancer in the patient. 

17. The method of claims 15 or 16 wherein the biological sample is a 
portion of a breast tumor 

18. The method of claim 15 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cONA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7, and therefrom determining the 
presence of breast cancer in the patient. 

19. The method of claim 16 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide encoded by a nucleotide sequence selected from the group 
consisting of SEQ ID NO: 78-86, 144, 145, 153, 167, 177, 193, 199, 205,208,215,217,220, 
241, 242, 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and 
sequences that hybridize thereto under stringent conditions; and therefrom determining the 
presence of breast cancer in the patient. 
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20. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting an amount, in a biological sample, of at least one polypeptide 
accordmg to claim 7 at a first point in time; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

21. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting in a biological sample an amount of at least one polypeptide 
at a first point in time, the polypeptide being encoded by a nucleotide sequence selected from 
the group consisting of SEQ ID NQ: 78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 
217. 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 
290 and sequences that hybridize thereto under stringent conditions; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of polypeptide detected in steps (a) and (b), 
and therefrom monitoring the progression of breast cancer in the patient. 

22. The method of claims 20 or 21 wherein the biological sample is a 
portion of a breast tumor. 

23. The method of claim 20 wherein the step of detecting comprises 
contacting a portion of the biological sample with a monoclonal antibody according to 
claim 9. 

24. The method of claim 21 wherein the step of detecting comprises 
contacting the biological sample with a monoclonal antibody that binds to a polypeptide 
encoded by a nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 
144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 
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256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize 
thereto under stringent conditions. 

25. The method of claim 20 wherein said polypeptide comprises an epitope 
of an amino acid sequence encoded by at least one nucleotide sequence selected from the 
group consisting of SEQ ID NO:l, 3-26, 28-77, 142, 143, 146-152, 154-166, 168-176, 178- 
192, 194-198, 200-204, 206, 207, 209-214, 216, 218, 219, 221-240, 243-245, 247, 250, 251, 
253, 255, 257-266, 268, 269, 271-273, 275, 276, 278, 280, 281, 284, 288 and 291-297. 

26. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule encoding a polypeptide according to claim 7 at a first point in time; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of R.NA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

27. The method of claim 26 wherein the step of detecting comprises: 

(a) preparing cDNA from RNA molecules within the biological sample; 

and 

(b) specifically amplifying cDNA molecules that are capable of encoding 
at least a portion of a polypeptide according to claim 7. 

28. A method for monitoring the progression of breast cancer in a patient, 

comprising: 

(a) detecting an amount, within a biological sample, of at least one RNA 
molecule at a first point in time, the RNA molecule encoding a polypeptide encoded by a 
nucleotide sequence selected from the group consisting of SEQ ID NO: 78-86, 144, 145, 153,-; 
167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 
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274, 277, 279, 282, 283, 285-287, 289, 290 and sequences thai hybridize thereto under 
stnngent conditions; 

(b) repeating step (a) at a subsequent point in time; and 

(c) comparing the amounts of RNA molecules detected in steps (a) and 
(b), and therefrom monitoring the progression of breast cancer in the patient. 

29. - A pharmaceutical composition, comprising a polypeptide according to 
claim 7 and a physiologically acceptable carrier. 

30. A pharmaceutical composition for inhibiting the development of breast 
cancer, comprising a polypeptide and a physiologically acceptable carrier, the polypeptide 
being encoded by a nuclebtide seqaence 'sdiected from the group consisting of SEQ ID NO: 
78-86, 144, 145, 153, 167, 177, 193; 199, 205, 208, 2r5,'2f7, ^20, 241, 242, 246, 248, 249, 
252, 256, 267, 270; 274,-277, 279, 282, 283, 285-287, 289, 290 and sequences that hybridize 
thereto under stringent conditions. - 

31. A vaccine, comprising a polypeptide according to claim 7 and an 
immune, response enhancer. 

32. A vaccine, comprising a DNA molecule according to any one of 
claims 1-3. ■ 

33. A vaccine, comprising a recombinant expression vector comprising a 
DNA molecule according to any one of claims 1-3. 

34. : A vaccine for inhibiting the development of breast cancer, comprising 
a polypeptide and an immune response enhancer, the polypeptide being encoded by a 
nucleotide sequence Iselected frorn the group consisting of SEQ ID NO: 78-86, 144, 145, 153,'. 
167, 177,. 193, 199, 205, 20g, 215, 217, 220, 241, 242, 246, 248, 249, 252, 256, 267, 270, 
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274, 277, 279, 282, 2S3, 285-287, 289, 290 and sequences that hybndize thereto under 
stringent conditions. 

35. A phannaceutical con-iposition according lo either of claims 29 or 30, 
for use in the manufacture of a medicament for inhibiting the development of breast cancer in 
a patient, comprising administering to a patient. 

36. A vaccine according, lo any one of claims 31-34, for use in the 
manufacture of a medicament for inhibiting the development of breast cancer in a patient. 



37. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies according to claim 9; and 

(b) a detection reagent. 



38. A diagnostic kit comprising: 

(a) one or more monoclonal antibodies that bind to a polypeptide encoded 
by a nucleotide sequence selected from the group consisting of sequences provided in SEQ 
ID 78-86, 144, 145, 153, 167, 177, 193, 199, 205,208,215,217, 220, 241,242 and 246, 248, 
249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289, 290; and 

(b) a detection reagent. 

39. The kit of any one of claims 37 or 38 wherein the monoclonal 
antibody(s) are immobilized on a solid support. 

40. A diagnostic kit comprising two polymerase chain reaction prim.ers, at 
least one of the primers being specific for an RNA molecule according to claim 4. 

41. The kit of claim 40, wherein at least one of the polymerase chain 
reaction primers comprises at least about 10 contiguous nucleotides of an RNA molecule 
according to claim 4. 
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42. A diagnostic kit compnsing two polymerase chain reaction primers, at 
least one of the primers being specific for an RNA molecule encoding a polypeptide encoded 
by a nucleotide sequence selected from the group consisting of SEQ ID NOS:78"86, 144, 
145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252, 256, 267, 
270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 

43. The kit of claim 42, wherein at least one of the polymerase chain 
reaction primers comprises at least about 10 contiguous nucleotides of an RNA molecule 

encoding a polypeptide encoded by a nucleotide sequence selccicd from the group consisting 
of SEQ ID NOS:7S-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 
246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 

44. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe containing at least about 15 contiguous nucleotides of a DNA molecule 
according to claim 4. 

45. A diagnostic kit comprising at least one oligonucleotide probe, the 
oligonucleotide probe comprising at least about 15 contiguous nucleotides of a DNA 
sequence selected from the group consisting of SEQ ID NOS:78-86, 144, 145, 153, 167, 177, 
193, 199. 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252, 25^6, 267, 270, 274, 277, 279, 
282, 2S3, 285-287, 289 and 290. 

46. A diagnostic kit comprising at least one oligonucleotide probe specific 
for a DNA molecule according to claim 4. 

• 47.. The kit of claim 46, wherein the oligonucleotide probe comprises at 
least about 15 contiguous nucleotides of a DNA molecule according to claim 4. 

48. A diagnostic kit comprising at least one oligonucleotide probe specific 
for a DNA sequence selected from the group consisting of SEQ ID NOS:78-86, 144, 145, 
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153. 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 246, 248, 249, 252, 256. 267, 270. 
274, 277, 279, 282, 283, 285-287, 289 and 290. 

49. The kit of claim 48, wherein the oligonucleotide probe comprises at 
least about 15 contiguous nucleotides of a DNA sequence selected from the group consisting 
of SEQ ID NOS:78-86, 144, 145, 153, 167, 177, 193, 199, 205, 208, 215, 217, 220, 241, 242 
246, 248, 249, 252, 256, 267, 270, 274, 277, 279, 282, 283, 285-287, 289 and 290. 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B18Ag1 



TTA GAG ACC CAA TTG GGA CCT AAT TGG GAC CCA AAT TTC TCA AGT GGA 48 
Leu Glu Thr Gin Leu Gly Pro Asn Trp Asp Pro Asn Phe Ser Ser Gly 
15 10 15 

GGG AGA ACT TTT GAC GAT TTC CAC CGG TAT CTC CTC GTG GGT ATT CAG % 
Gly Arg Thr Phe Asp Asp Phe His Arq Tyr Leu Leu VqI Gly lie Gin 
20 25 30 

GGA GCT GCC CAG AAA CCl ATA AAC TTG TCT AAG GCG ATT GAA GTC GTC 144 
Gly Ala Ala Gin Lys Pro lie Asn Leu Ser Lys Ala lie Glu Val Vol 
35 40 45 

CAG GGG CAT GAT GAG TCA CCA GGA GTG TTT TTA GAG CAC CTC CAG GAG 192 
Gin Gly h.s Asp Glu Ser Pro Gly Val Phe Leu Glu His Leu Gin Glu 
50 55 60 

GCT TAT CGG ATT TAC ACC CCT TTT GAC CTG GCA GCC CCC GAA AAT AGC 240 
Ala Tyr Arg lie Tyr Thr Pro Phe Asp Leu Ala Ala Pro Glu Asn Ser 
65 70 75 80 

CAT GCT CTT AAT TTG GCA TTT GTG GCT CAG GCA GCC CCA GAT AGT AAA 288 
His Ala Leu Asn Leu Ala Phe Val Ala Gin Ala Ala Pro Asp Ser Lys 

85 90 95 

AGG AAA CTC CAA AAA CTA GAG GGA TTT TGC TGG AAT GAA TAC CAG TCA 336 
Arq Lys Leu Gin Lys Leu Glu Gly Phe Cys Trp Asn Glu Tyr Gin Ser 

100 ^ M05 ^ ^ no 

GCT TTT AGA GAT AGC CTA AAA GGT TTT 363 
Ala Phe A^^g Asp Ser Leu Lys Gly Phe 
115 120 



Fig. 6 
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NUCLEOTIDE SEQUCNE OE THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag1 



GC TGGGCACAGT GGCTCATACC TGIAATCCTG ACCGTTTCAG AGGCTCAGGT 60 

CG CTTGAGCCCA AGATTTCAAG ACTAGTCTGG GTAACATAGT GAGACCCTAT 120 

AA AAATAAAAAA ATGAGCCTGG TGTAGTGGCA CACACCAGCT GAGGAGGGAG 180 

CT 'AGGAGA 196 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B17Ag2 



GC 


TTGGGGGCTC 


TGACTAGAAA 


TTCAAGGAAC 


CTGGGATTCA 


AGTCCAACTG 


60 


AC 


TTACACTGTG 


GNCTCCAATA 


AACTGCTTCT 


TTCCTATTCC 


CTCTCTATTA 


120 


AA 


GGAAAACGAT 


GTCTGTG7A1 


AGCCAAGTCA 


GNTATCCTAA 


AAGGAGATAC 


180 


AT 


TAAATATCAG 


AATGTAAAAC 


CTGGGAACCA 


GGTTCCCAGC 


CTGGGATTAA 


240 


CA 


AGAAGACTGA 


ACAGTACTAC 


TGTGAAAAGC 


CCGAAGNGGC 


AATATGTTCA 


300 


TT 


GAAGGATGGC 


TGGGAGAATG 


AATGCTCTGT 


CCCCCAGTCC 


CAAGCTCACT 


360 


CT 


CCTTTATAGC 


CTAGGAGA 








388 



Fig. 8 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag2a 

GC CTATAATCAT GTTTCTCATT ATTTTCACAT TTTATTAACC AATTTCTGTT 60 

AA AATATGAGGG AAATATATGA AACAGGGAGG CAATGTTCAG ATAATTGATC 120 

Ti: ATTTCTACAT CAGATGCTCT TTGCTTTCCT GTT-TATTTGC TTTTTATTTC 180 

GG TCGAATGTAA TAGCTTTGTT TCAAGAGAGA GTTTTGGCAG TTTCTGTAGC 240 

GT GCTCATGTCT CCAGGCATCI.-ATJTGCACTT. TAGGAGGTGT CGTGGGAGAC 300 

CT ATTTTTTCCA TATTTGGGCA ACTACTA ' 337 

Fig. 9 
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NUCLEOTIDE SEOUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1b 

GC CATACAGT6C CTTTCCATTT ATTTAACCCC CACCTGAACG GCATAAACTG 60 

GC TGGTGTTTTT TACTGTAAAC AATAAGGAGA CTTTGCTCTT CATTTAAACC 120 

AT TTCATATTTT ACGCTCGAGG GTTTTTACCG GTTCCTTTTT ACACTCCTTA 180 

IT TAAGTCGfTi GGAACAAGAT ATTTTTTCTT TCCTGGCAGC TTTTAACATT 240 

TT TGTGTCTGGG GGACTGCTGG TCACTGTTTC TCACAGTTGC AAATCAAGGC 300 

CC AAGAAAAAAA AATTTTTTTG TTTTATTTGA AACTGGACCG GATAAACGGT 360 

CG GCTGCTGTAT ATAGTTTTAA ATGGTTTATT GCACCTCCTT AAGTTGCACT 420 

GG GGGGNTTTTG NATAGAAAGT MTTTANTCAC ANAGTCACAG GGACTTTTNT 480 

NA CTGAGCTAAA AAGGGCTGNT TTTCGGGTGG GGGCAGATGA AGGCTCACAG 540 

TC TCTTAGAGGG GGGAACTNCT A 571 



Fig, 10 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B13Ag1a 

TA ATAACTTAAA TATATTTTGA TCACCCACTG GGGTGAJAAG ACAATAGATA 60 

TT TCCAAAAAGC ATAAAACCAA AGTATCATAC CAAACCAAAT TCATACTGCT 120 

CC GCACTGAAAC TTCACCTTCT AACTGTCTAC CTAACCAAAT TCTACCCTTC 180 

GG TGCGTGCTCA CTACTCTTTT TTTTTTTTTT TTTNTTTTEGAGATGGAGTC 240 

CA GCCCAGGGGT GGAGTACAAT tjGCACAACCT CAGCTXACTG NAACCTCCGC 300 

TT CATGAGATTC TCCTGNTTCA iGtUTTCCCAG TAGCTGGGAG TACAGGTGTG 360 

TG CCTGGNTAAT CTTTTTTNGT TTTNGGGTAG AGATGGGGGT TTTACATGTT 420 

TG: GTNTCGAACT CCTGACCTCA AGTGATCCAC, CCACGTCAGG . C JCCCAAAGT 480 

TACAGACATGAG CCACTGNGCC CAGNCCTGGT ,GCATGCTCAC TTCTCTAGGC 540 

548 



Fig. 11 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA BIIAgl 

TG CACATGCAGA ATATTCTATC GGTACTTCAG CTATTACTCA TTTTGATGGC 60 

AG CCTATCCTCA AGATGAGTAT TTAGAAAGAA TTGATTTAGC GATAGACCAA 120 

GC ACTCTGACTA CACGAAATTG TTCAGATGTG ATGGATTTAT GACAGTTGAT 180 

GA GATTATTAAG TGATTATTTT AAAGGGAATC CATTAATTCC AGAATATCTT 240 

TC AAGATGATAT AGAAATAGAA CAGAAAGAGA CTACAAATGA AGATGTATCA 300 

TA TTGAAGAGCC TATAGTAGAA AATGAATTAG CTGCATTTAT TAGCCTTACA 360 

TT TTCCTGATGA ATCTTATATT CAGCCATCGA CATAGCATTA CCTGATGGGC 420 

GA ATAATAGAAA CTGGGTGCGG GGCTATTGAT GAATTCATCC NCAGTAAATT 480 

AC AAAATATAAC TCGATTGCAT TTGGATGATG GAATACTAAA TCTGGCAAAA 540 

GG AGCTACTAGT AACCTCTCTT TTTGAGATGC AAAATTTTCT TTTAGGGTTT 600 

CT ACTTTACGGA TATTGGAGCA TAACGGGA 638 

Fig, 12 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3c 

ACTGATGGAT GTCuCCGGAG GCGAGGGGCC TTATCTGATG CTCGGCTGCC TGTTCGTGAT 60 

GTGCGCGGCG ATTGGGCTGT TTATCTCAAA CACCGCCACG GCGGTGCTGA TGGCGCCTAT 120 

TGCCTTAGCG GCGGCGAAGT CAATGGGCGT CTCACCCTAT CCTTTTGCCA TGGTGGTGGC 180 

GATGGCGGCT TCGGCGGCGT TTATGACCCC GGTCTCCTCG CCGGTTAACA CCCTGGTGCT 240 

TGGCCCTGGC AAGTACTCAT TTAGCGATTT TGTCAAAATA GGCGTG 286 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG1 

AG CAGCCCCTTC TTCTCAATTT CATCTGTCAC TACCCTGGTG TAGTATCTCA 60 

CA TTTTTATAGC CTCCTCCCTG GTCTGTCTTT TGATTTTCCT GCCTGTAATC 120 

AC ATAACTGCAA GTAAACATTT CTAAAGTGTG GTTATGCTCA TGTCACTCCT 180 

AA ATAGTTTCCA TTACCGTCTT AATAAAATTC GGATTTGTTC TTTNCTATTN 240 

CA CCTATGACCG AA 262 



Fig. 14 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B9CG3 

AG CAAAGCCAGT GGTTTGAGCT CTCTACTGTG TAAACTCCTA AACCAAGGCC 60 

TA AATGGT6GCA GGATTTTTAT TATAAACATG TACCCATGCA AATTTCCTAT 120 

G^ TATATTCTTC TACATTTAAA CAATAAAAAT AATCTATTTT TAAAAGCCTA 180 

AG TTAGGTAAGA GTGTTTAATG AGAGGGTATA AGGTATAAAT CACCAGTCAA 240 

TG CCTATGACCG A 261 



Fig. 15 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B2CA2 



GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

GG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT 208 



Fig. 16 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA1 



GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

GG ANAAGGCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT 208 



^\SFig. 17 



BNSDOCID <WO 9B4532eA2 I > 



SUBSTITUTE SHEET (RULE 26) 



wo «)8/45328 



rCT/US98/06939 



18/22 



NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA2 



GG GCATGGACGC AGACGCCTGA CGTTTGGCTG AAAATCTTTC ATTGATTCGT 60 

AT AGGAAAATTC CCAAAGAGGG AATGTCCTGT TGCTCGCCAG TTTTTNTGTT 120 

GG" ANAA3GCAAN GAGCTCTTCA GACTATTGGN ATTNTCGTTC GGTCTTCTGC 180 

CG NCTTGCNANG ATCTTCAT 208 



Fig. 18 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B3CA3 

AG GGAGCAAGGA GAAGGCATGG AGAGGCTCAN GCTGGTCCTG GCCTACGACT 60 

CT GTCGCCGGGG ATGGTGGAGA ACTGAAGCGG GACCTCCTCG AGGTCCTCCG 120 

TC'NCCGTCCAGG AGGAGGGTCT -TTGCGTGGTC TNGGAGGAGG -GGGGGGAGAA 180 

TC ATGGTCNACA TCCC 204 



Fig. 19 
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NUCLEOTIDE SEQUENCE OF THE REPRESENTATIVE 
BREAST-TUMOR SPECIFIC cDNA B4CA1 



TC aggagcgggt AGAGTGGCAC CATTGAGGGG ATATTCAAAA ATATTATTTT 60 

TG ATAGTTGCTG AGTTITTCTT TGACCCATGA GTTATATTGG AGTTTATTTT 120 

CC AATCriCAiGG ACATGTTAGA CTTAlTTTCT GTTAATGAT T NCTATTTTTA 180 

GA TTTGAGAAAT TGGTTNTTAT TATATCAATF TTTGGTATTT GTTGAGTTTG 240 

GC TTAGTATGTG ACCA 264 

Fig. 20 
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3. I I Claims Nos : 
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This international Searching Authori ty found ""Itiple (groups of) 
inventions in this international application, as follows. 

1 riaims- 1 2 4-10 12.13.15,17.18,20,22.23.25-27,29,31-33. 
1. Claims. I»|^^3j°^jtii^44,i6,47 (all partially) 



l"solftl3"DiA niolecule conpri si ng^a nucleotide ^sequ^^^ 
Sea ID 1. the corresponding polypeptide, an isolatedUNA or 
RNA'molecule comprising a nucleotide sequence complementary 
to SM ID 1. a>ecombinant expression vector comprising 
that llecule a host cell transfected with that vector, a 
- SnocroiarlntiU binding to the PolyP^Ptide enc^^^^^^^ 
Sea ID 1 as well as pharmaceutical compositions, vaccines. 
di2gnJstic S^thods used in the diagnosis and therapy of 
breast cancer. 

2. Claims: 1.2.4-10.12.13 15 17 18 20 22 23 25-27,2^ 
35-37,39-41.44.46,47 (all partially) 

IsotfteS^DN^'^lecule comprising a ""Sle^t^ide seque^^^ 
Sea ID 3. the corresponding polypeptide, an isolated una or 
RNA ™ieculS comprising a nucleotide sequence conpl^^Jtary 
to Sea ID 3. a recombinant expression vector comprising 
that TOlecul4 a host cell transfected with that vector, a 
ionJcTSnal antibody binding to the polypeptide encoded by 
monoc onai ^nti^p y ^^^^.g^tical compositions, vaccines, 

diagnostic Jilthods used in the diagnosis and therapy of 
breast cancer. 

...ibidem for each of sequences 4-26, 28-77 

3. Claims: 3 (completely) .4-ia p. 13 15 17.18.20.^^^ 

29.31-33.35-37.39-41.44.46,47 (all partially) 



}sraU°d"DNA^molecule conprising a nucleotide sequence with 
Seq ID 141 the corresponding polypeptide, an isolated una 
or RMA molecule comprising a nucleotide sequence 
JSmSlJnStlry to Seq. ID 141. a -^<^o"*i";"^,^JP:"|;°;^^ 
vector comprising that mo ecu e. a host £^11 transfected 
with that vector, a monoclonal antibody binding to tne 
polypeptide encoded by Seq. ID 141. as « ™.thods 
JhaSa?eutical compositions vaccines diagnostic methods 
used in the diagnosis and therapy of breast cancer. 

Claims: 1.2.4-10,12.13.15.17 18 20 22.23.25-27,29,31-33, 
35-37,39-41,44,46,47 (all partially) 
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Invention 77-200: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 142, the corresponding polypeptide, an isolated DNA 
or RNA molecule comprising a nucleotide sequence 
complementary to Seq. ID 142, a recombinant expression 
vector comprising that molecule, a host cell transfected 
with that vector, a monoclonal antibody binding to the 
polypeptide encoded by Seq. ID 142, as well as 
pharmaceutical connpositions, vaccines, diagnostic methods 
used in the diagnosis and therapy of breast cancer. 

...ibidem for each of sequences 

143,146-152,154-165,168-176,178-192,194-198,200-204,206,207,2 
09-214,216,218,219,221-240,243-245,247,250,251,253,255,257-26 
6,268,269,271-273,275,276,278,280,281,284,288 and 291-297 



5. Claims: 11,12,14,16,17,19,21,22,24,30,34,35,36,38,39,42,43, 
45,48,49 (all partially) 



Invention 201-241: 

Isolated DNA molecule comprising a nucleotide sequence with 
Seq.ID 78, the corresponding polypeptide, an isolated DNA or 
RNA molecule comprising a nucleotide sequence complementary 
to Seq. ID 78, a recombinant expression vector comprising 
that molecule, a host cell transfected with that vector, a 
monoclonal antibody binding to the polypeptide encoded by 
Seq. ID 78, as well as pharmaceutical compositions, 
vaccines, diagnostic methods used in the diagnosis and 
therapy of breast cancer. 

...ibidem for each of sequences 

78-86,144,145,153,167,177,193,199,205,208,215,217,220,241,242 
,246,248,249,252,256,267,270,274,277,279,282,283,285-287,289, 
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